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Chapter  1 

Communication  Concepts:  Circuit 

Designer  Perspective 


1.1     Introduction 

What  does  GSM  stand  for?  Why  is  the  dynamic  range  in  a  digital  enhanced  cordless 
telecommunication  receiver  specified  to  be  around  93  dB?  What  are  multipath 
fading  and  Doppler  shifts?  How  would  the  noise  figure  requirement  of  a  low-noise 
amplifier  change  if  I  change  my  radio  from  one  standard  (DECT)  to  another  (GSM)? 
Would  Doppler  shift  be  a  problem  in  a  standard  such  as  PHS,  and  what  circuit 
techniques  do  I  have  to  address  this  problem? 

These  are  all  the  types  of  questions  integrated  circuit  (IC)  designers  may  ask  when 
dealing  with  circuit  design  for  mobile  wireless  communication.  Unfortunately, 
IC  designers  must  be  familiar  with  diverse  sources  of  knowledge,  and  each  of  these 
sources  is  written  for  a  specific  circle  of  engineers,  so  finding  and  integrating  relevant 
knowledge  across  these  disciplines  proves  difficult.  For  example,  if  you  need  to 
understand  modulation  techniques  (such  as  quadrature  phase  shift  keying  (QPSK)) 
and  you  consult  a  standard  communication  text,  you  would  find  that  the  design 
trade-off  is  typically  presented  using  channel  capacity  and  the  demodulator's 
signal-to-noise  ratio  (SNR)  as  key  parameters.  This  is  of  little  use  to  a  circuit 
designer,  who  is  more  interested  in  the  SNR  of  the  whole  receiver  than  that  of 
demodulator  itself.  Nor  is  channel  capacity  a  relevant  parameter,  since  circuit 
designers  have  no  control  over  it.  On  the  other  hand,  if  you  go  to  a  classical  radio 
frequency  (RF)  circuit  text,  you  will  be  exposed  to  figures  of  merit  such  as  input  third- 
order  intercept  points  (IIP3)  of  individual  circuit  subcomponents.  However,  you  will 
not  find  any  interpretation  as  to  how  they  affect  overall  receiver  performance,  such  as 
the  bit  error  rate  (BER).  Thus  this  information  may  not  be  very  useful  to  a  modern- 
day  integrated  circuit  designers  who  has  the  whole  receiver  to  worry  about  and 
therefore  would  also  need  to  interpret  and  relate  the  figures  of  merit  of  individual 
circuit  subblocks  (given  in  circuit  terminology)  to  the  performance  measure  of  the 
whole  receiver  (given  in  communication  terminology).  Often,  this  performance 
measure  is  the  only  information  available  to  an  integrated  circuit  designer.  Hence 
the  following  questions  are  likely:  What  would  a  poor  receiver  I1P3  do  to  the  BER  for 
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a  standard  using  Gaussian  minimum  shift  keying?  Is  modifying  the  modulation 
scheme  to  reduce  the  required  IIP3  of  a  receiver  a  reasonable  option?  Yet  a  classical 
RF  circuit  text  is  unlikely  to  answer  these  questions.  Therefore,  it  is  the  goal  of  this 
and  the  next  chapter  to  address  these  questions  and  to  present  the  answers  in  a 
language  that  most  integrated  circuit  designers  can  understand. 


1.2     Overview  of  Wireless  Systems 

In  this  subsection  we  review  a  basic  communication  system,  as  shown  in  Figure  1.1. 
Such  a  communication  system  consists  of  a  transmitter,  a  channel  (air  for  wireless 
communication,  cable  for  wireline  communication),  and  a  receiver.  In  a  typical 
digital  communication  system  the  modulator  in  the  transmitter  takes  the  digital 
bit  stream  from  the  source  and  modulates  it  on  a  high-frequency  carrier.  The  back 
end  then  conditions  the  modulated  signal  to  a  form  suitable  for  transmission. 
The  channel  takes  this  transmitted  signal  and  adds  distortion,  noise,  interference, 
and  other  impairments  to  it.  The  receiver  consists  of  a  front  end  that  takes  this  highly 
impaired  signal  and  conditions  it  to  a  form  that  the  demodulator  can  demodulate. 

Next  let  us  review  specifically  what  wireless  communication  (channel  is  air)  is  all 
about.  Mobile  wireless  systems  provide  users  with  the  opportunity  to  travel  freely 
within  the  service  area.  These  systems  are  made  possible  by  the  unique  property  of 
using  radio  waves  as  a  transmission/receiving  medium.  Radio  communication  is 
based  on  using  an  antenna  for  radiating  and  receiving  electromagnetic  waves.  When 
the  user's  radio  transceiver  is  stationary  over  a  prolonged  period  of  time,  the  term 
fixed  radio  is  used.  A  radio  transceiver  capable  of  being  carried  or  moved  around,  but 
stationary  during  transmission,  is  called  a  portable  radio.  A  radio  transceiver 
capable  of  being  carried  and  used,  by  vehicle  or  by  a  person  on  the  move,  is  called 
a  mobile  radio. 

A  radio  transceiver  consists  of  both  the  receiver  and  the  transmitter.  In  this  book 
we  start  with  the  receiver  side,  where  a  detail  treatment  is  given.  We  then  carry  the 
discussion  over  to  the  transmitter,  concentrating  on  components  not  already  covered 
in  the  receiver,  such  as  the  power  amplifier. 

To  understand  portable  receiver  design,  we  must  understand  the  first  relevant 
property  of  a  radio  communication  channel:  while  travelling,  a  user  of  mobile 
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Fig.  1.1    Block  diagram  of  a  typical  communication  system 
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system  must  be  aware  of  sudden  changes  in  signal  quality  caused  by  one  or  more  of 
the  following: 

1 .  movement  relative  to  the  corresponding  base  station 

2.  the  surroundings 

3.  multipath  propagation 

The  user  must  consider  all  these  issues  because  unlike  wireline  channels 
such  as  fibre  optics  and  cables,  the  radio  communication  channel  is  unprotected 
against  natural  disturbance  such  as  lightning,  temperature  variation  or  humidity. 
Mathematically,  the  airwaves  environment  belongs  to  a  class  of  nonstationary 
random  fields  for  which  the  data  transmission  behavior  is  difficult  to  predict  and 
model.  Therefore,  information  degradation  due  to  the  movement  of  a  user  relative 
to  reflection  points  and  scatterers  (buildings,  trees,  etc),  which  causes  Doppler 
frequency  shift  or  multipath  fading,  is  troublesome  to  predict.  Moreover,  the 
channel  is  not  protected  against  radio  transmission  in  other  bands.  Finally,  as 
with  wireline  communication  systems,  the  signals  amplitude  must  be  large  enough 
to  overcome  additive  white  guassian  noise  (AWGN)  of  the  channel.  The  receiver 
must  detect  a  signal  whose  amplitude  has  a  large  fluctuation  of  amplitude,  buried 
under  noise  and  in  the  presence  of  interference  with  a  large  amplitude.  The  overall 
signal  amplitude  required  to  achieve  an  acceptable  BER  to  combat  all  of  these 
hurdles  is  a  function  of  the  modulation  scheme.  This  situation,  as  will  be  shown, 
will  lead  to  tight  sensitivity  requirements. 

The  second  relevant  property  is  that  all  radio  systems  share  the  same  natural 
resource:  the  airwaves  (frequency  bands  and  space).  Filtering  protects  the 
channel  from  interference  from  other  radio  frequency  bands.  Furthermore,  since  the 
transmission  medium  is  shared,  we  must  ensure  separation  between  the  time  when 
the  portable  is  transmitting  versus  the  time  when  it  is  receiving.  This  separation  can 
be  done  in  both  the  time  and  frequency  domains  and  is  called  time  division  duplexing 
(TDD)  or  frequency  division  duplexing  (FDD),  respectively.  In  the  FDD  case,  proper 
filtering  must  be  done.  In  addition,  since  we  have  a  shared  transmission  medium,  we 
must  provide  access  for  individual  users,  including  methods  of  multiple  access,  such 
as  time  division  multiplexing  access  (TDMA),  frequency  division  multiplexing  access 
(FDMA),  code  division  multiplexing  access  (CDMA),  and  orthogonal  frequency 
division  multiplexing  acess,  (OFDMA).  In  the  case  of  FDMA,  to  provide  channel 
selection,  again  proper  filtering  must  be  done.  Finally,  we  must  filter  out  jamming 
interference  (i.e.  human-made  noise,  adjacent  channel  interference,  and  co-channel 
interference  inherent  to  cellular  systems).  All  of  these  issues  affect  the  receiver's 
selectivity  requirements. 

The  hostile  environment  and  interference  also  cause  intersymbol  interference 
(ISI),  and  the  fast  moving  mobile  nature  of  the  system  causes  phase  impairment. 
Mobile  radio  systems  employ  sophisticated  coding  and  equalization  techniques 
to  combat  ISI  and  diversity  techniques  to  combat  Doppler  effect.  Carrier  and 
timing  recovery  circuits  that  introduce  low-phase  noise  are  also  needed  in  the 
receiver. 
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On  the  other  hand  to  understand  portable  transmitter  design,  we  must  be  sensitive  to 
any  impacts  such  design  will  have  on  the  receiver.  For  example,  since  the  received 
channel  is  not  protected  against  radio  transmission  in  other  bands,  as  mentioned 
above,  the  transmitter  should  be  so  design  that  the  transmitted  power  does  not  spill 
over  into  other  received  channels.  At  the  same  time  the  transmitter  power  should  be 
strong  enough,  and  transmitted  noise  small  enough,  that  under  worst  case,  the  received 
power  (attenuated  by  channel,  under  fading)  is  large  enough  to  overcome  received 
noise  (due  to  transmitted  noise,  channel  noise). 

Our  goal  for  the  rest  of  this  chapter  is  to  examine  all  the  aforementioned  issues 
and  determine  with  the  boundary  conditions  that  such  issues  impose  on  the  front 
end  of  a  receiver  and  the  back  end  of  a  transmitter.  Specific  radio  standards  are  used 
as  examples.  For  the  receiver,  starting  from  an  ideal  environment  with  only  AWGN 
in  the  channel,  we  explain  how  to  achieve  an  acceptable  BER  as  a  function  of  the 
modulation  scheme.  Next,  the  channel  is  restricted  to  a  finite  bandwidth,  and  we 
show  how  this  causes  ISI.  We  discuss  pulse  shaping  techniques  to  combat  ISI.  Next, 
we  discuss  the  impact  of  the  environment  on  the  channel,  such  as  path  loss, 
multipath  fading,  and  Doppler  shift,  and  examine  how  the  environment  can  lead 
to  a  reduced  received  envelope  amplitude  and  increased  ISI  and  how  the  BER  will 
be  degraded.  Finally,  we  introduce  techniques  to  combat  these  impairments. 
Later  on,  in  chapter  9,  for  the  transmitter,  using  an  example  standard  (with  a  given 
modulation  scheme),  the  specs  on  the  tolerated  transmission  error  is  translated  to  the 
required  transmitted  SNR. 
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The  3G/4G  standards  includes  LTE  (Long  term  evolution),  where  in  the  beginning 
the  bandwidth  will  be  5MHz  and  the  spectrum  band  at  2GHz.  Another  3G  standard 
in  this  group  is  the  super3G  experimental  systems,  which  is  aligned  with  the  3GPP 
(third  generation  partnership  project)  LTE  specification.  Here  the  bandwidth 
includes  option  of  5MHz,  10MHz  and  20MHz.  The  downlink  (DL,  from  base 
station  to  terminal)  uses  OFDMA  while  the  uplink  (UL,  from  terminal  to  base 
station)  uses  SC-FDMA  (single  carrier  frequency  division  multiplexing).  In  spite  of 
its  name  SC-FDMA  can  be  understood  by  starting  with  OFDMA.  It  has  an 
additional  discrete  Fourier  transform  (DFT)  block  in  front  of  the  OFDMA  systems, 
which  will  be  treated  later.  Duplex  is  FDD.  Modulation  scheme  for  DL  is  QPSK, 
16QAM,  64  QAM,  depending  on  the  bandwidth.  UL  is  QPSK,  16  QAM,  again 
depending  on  the  bandwidth.  It  also  uses  diversity  by  multi-antenna  (MIMO)  for  DL, 
with  options  of  1  x 2,  2x2  and  4x4.  Peak  data  rate  for  DL  is  250-300Mbps  and  for 
UL  is  40-50Mbps.  IMT-Advanced  (International  Mobile  Telecommunications 
Advanced)  is  a  4G  standard.  Here  the  target  peak  data  rate  is  100Mbps.  Bandwidth 
is  up  to  100MHz.  The  bandwidth  is  scalable.  Bandwidth  for  UL  and  DL  is 
asymmetric.  It  uses  layered  OFDMA.  Higher  order  MIMO  is  also  adopted.  Even 
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Table  1.1    Summary  of  DECT  requirements 


Cell  range 

Frequency  range  (RF) 
Carrier  spacing 
Channels/carrier 
Duplex  method 

Channelization 

Speech  coding 

Modulation 

Gross  data  rate  (Rb) 

BER 

Maximum  transmitted  power 


50  m  -  400  m 

1880  -  1900  MHz 

1.728  MHz  Peak 

2  x  12 

TDD  using  two  slots  on  the 

same  RF  carrier 
TDMA/FDMA 
32  kbit/s  ADPCM 
GMSK  (BTb  =  0.3) 
1.152  Mbit/s 
lO"3 
250  mW 


though  the  3G/4G  standards  appear  quite  complicated,  the  design  of  the  VLSI 
circuits  implementing  the  transceiver  will  be  illustrated  in  this  book  by  using 
simpler  standards  as  examples.  Specifically  the  DECT  (digital  enhanced  cordless 
telecommunications)  standard  is  used  for  illustrating  the  receiver  design,  while  the 
GSM  (global  system  for  mobile  communications)  standard  is  used  for  illustrating 
the  transmitter  design. 

In  this  sub-section  we  cover  the  DECT  standard,  which  can  provide  wireless 
access  for  both  indoor  and  outdoor  environment,  with  cell  radius  ranging  from  50  to 
several  hundred  meters.  Hence,  it  is  suitable  for  the  development  of  residential, 
business,  and  public  applications.  Moreover,  DECT  is  not  limited  to  telephony  (i.e., 
speech)  but  also  handles  text  and  data  as  well. 

The  DECT  system  operates  at  a  channel  bit  rate  of  1.152  Mbps.  FDMA  and 
TDMA  techniques  are  used  to  separate  the  different  users.  A  Gaussian  minimum 
shift  keying  (GMSK)  modulation  scheme  converts  the  incoming  data  onto  a  carrier 
wave.  There  are  10  carriers  with  a  channel  spacing  of  1.728  MHz  occupying  a  band 
that  spans  from  1.88  to  1.9  GHz.  Twelve  channels  are  time  division  multiplexed 
onto  each  carrier,  and  TDD  is  used  for  each  channel,  resulting  in  24  time  slots. 

Table  1.1  summarizes  the  DECT  requirements  that  are  important  for  our 
discussion. 
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We  mentioned  above  different  access  methods.  We  will  now  go  over  some  of  the 
features  of  these  accesses  [5,  6], 
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Fig.  1.2    TDMA:  block  diagram  and  timing  diagram  of  multiplxer 


1.4.1     TDMA 

TDMA  is  provided  by  interleaving  bit  streams  from  different  channels  into  one 
bit  stream,  as  shown  in  the  time  division  multiplex  box  of  the  transmitter  part  in 
Figure  1.2,  in  this  case  just  for  two  users.  Each  stream  is  continuously  divided  into 
groups  of  bits,  know  as  time  slots  and  then  these  time  slots  are  interleaved,  as  shown  in 
the  timing  diagram  of  Figure  1 .2.  Each  output  slot  occupies  half  the  time  of  the  input  slot 
since  the  bit  race  is  twice  as  large.  At  the  time  division  demultiplex  box  of  the  receiver 
part  in  Figure  1.2,  these  bit  streams  are  separated  back  into  individual  bit  streams. 
The  collection  of  bits  corresponding  to  precisely  one  time  slot  is  known  as  a 
frame.  In  order  to  relax  demultiplexing,  the  boundaries  of  time  slots  must  be  known 
i.e.  the  beginning  of  the  frame.  Additional  bits  for  this,  known  as  framing  bits,  are 
added.  The  framing  bits,  sometime  referred  to  as  reference  burst,  forces  a  measure 
of  synchronization.  Guard  time  is  also  included  to  account  for  different  phase  in  a 
different  time  slot.  Finally  a  preamble  at  the  beginning  of  each  traffic  burst  is  added 
to  allow  the  receiver  to  acquire  timing  and  carrier  phase,  since  each  traffic  burst  is 
transmitted  with  an  uncertain  phase  relative  to  the  reference  burst. 
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FDMA  has  been  used  in  analog  modulation,  e.g.  in  AM  transmission.  In  wireless 
communication,  transmission  is  typically  digital,  which  will  be  covered  in  the  next 
section.  Here  frequency  division  can  be  used  where  independent  data  streams  are 
transmitted  in  non-overlapping  frequency  bands.  FDMA  is  very  simple,  as  illustrated 
in  Figure  1.3,  in  this  case  again  for  just  two  users.  The  two  transmitters  have  output 
power  spectra  in  two  non-overlapping  bands,  where  they  usually  use  passband 
PAM  (pulse  amplitude  modulation,  covered  in  modulation  section).  To  ensure  that, 
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Fig.  1.3    FDMA:  block  diagram  and  frequency  response  of  bandpass  filters 

it  is  common  to  put  bandpass  filters  at  output  of  each  channel.  At  the  receiver,  similar 
bandpass  filters  eliminate  all  but  the  desired  data  signals.  The  path  to  the  desired 
received  signal  from  the  undesired  transmitted  signal  contains  two  bandpass  filters 
with  non-overlapping  passbands,  and  therefore  we  can  make  the  loss  of  the  crosstalk 
path  as  large  as  we  design  for.  A  consideration  is  the  nonlinearity  of  the  amplifiers  in  the 
transmitter.  These  nonlinearities  will  create  out  of  band  energy  that  can  interfere  with 
other  FDMA  channels.  It  is  possible  to  use  bandpass  filtering  after  the  amplifier  to 
reduce  these  spectral  components.  In  comparison,  for  FDMA,  filters  can  be  expensive 
to  implement  and  linearity  is  definitely  an  issue.  On  the  other  hand,  for  TDMA, 
propagation  delay  and  crosstalk  are  problems. 
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Code  division  multiple  access  (CDMA)  allow  multiple  users  to  transmit  signals  in 
the  same  frequency  band  at  the  same  time.  Compared  to  TDMA  and  FDMA,  the 
explanation  of  CDMA  uses  the  concept  of  signal  space,  which  can  be  more  abstract 
and  mathematical.  Signal  space  borrows  the  idea  of  space  from  the  mathematics  of 
linear  algebra,  where  a  simple  mathematical  space  example  is  a  vector  space  (in  a 
M-dimension  vector  space  each  element  is  represented  by  a  row  with  M  entries). 
In  our  day  to  day  experience  of  a  3-dimension  physical  world,  the  corresponding 
vector  space  is  a  3-dimension  vector  space,  with  each  element  being  a  3-dimension 
arrow.  The  explanation  of  CDMA  also  exploits  the  idea  of  subspace,  which  is  kind 
of  a  subset  obtained  from  the  original  space,  but  with  dimension  reduced.  Using 
the  example  a  3-dimension  physical  world,  with  a  3-dimensional  vector  space,  an 
example  of  subspace  may  be  a  2-dimensional  vector  space,  with  each  element  being 
a  2-dimension  arrow,  lying  in  a  plane.  Now  let  us  start  our  explanation.  In  signal 
space  suppose  the  available  bandwidth  is  B  Hz.  We  further  assume  this  is  to  be 
distributed  over  N  users.  For  a  time  duration  of  T  seconds,  all  these  waveforms  form 
the  subspace  (each  has  bandwidth  B),  and  will  each  have  subspace  dimension  (=N) 
approximately  given  as  2BT.   Therefore  there  is  enough  dimension  that  the 
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Fig.  1.4    (a)  CDMA  (b)  partial  correlation  in  CDMA:  channel  responses  h,  and  h2  over  waveform 
duration  T,  which  has  difference  in  delay,  x,  causing  partial  correlation 


N  different  users'  waveforms  can  stay  orthogonal  (roughly  speaking  this  means 
separation  is  "perfect"  i.e.  not  "correlated").  Specifically  this  means  that  if  the 
received  waveforms  are  decoded  using  matched  filters  (discussed  below),  each  filter 
selects  only  the  desired  signal,  while  rejecting  the  remaining  signals  (because  they 
are  orthogonal).  This  orthogonality  can  be  achieved  as: 

•  time  period  is  divided  into  2BT  time  slots;  one  time  slot  is  assigned  to  each  user. 
This  is  essentially  the  TDMA  concept  discussed  above. 

•  frequency  band  is  divided  into  2BT  frequency  bands;  one  band  is  assigned  to 
each  user.  This  is  the  FDMA  concept  discussed  above. 

•  2BT  orthogonal  waveforms,  h,(t),  h2(t). .  .hBT(t)  are  selected;  each  waveform 
occupies  bandwidth  B  and  time  T.  Then  each  individual  waveforms  is  assigned 
to  each  separate  user  for  transmission.  We  further  use  one  period  of  a  shift 
register  sequence  as  the  transmission  pulses.  The  sequence  is  so  designed  that 
they  are  orthogonal,  and  are  further  viewed  as  the  "code"  of  each  user.  At  the 
receiver,  if  the  same  sequence  is  used  to  access  the  incoming  signal,  orthogonal- 
ity ensures  only  the  user  with  the  proper  sequence/code  will  recover  the 
transmitted  signal;  the  others  will  be  rejected  because  the  cross-correlation 
product  is  zero.  This  is  CDMA  and  is  shown  in  Figure  1.4a 

In  comparison,  for  CDMA,  there  is  near-far  problem,  which  is  not  as  severe  in 
TDMA  and  FDMA.  Near-far  problem  occurs  when  the  interfering  signals  are  much 
stronger  than  the  desired  signal.  Thus  it  occurs  when,  for  example,  neighboring 
transmitters  (which  interfere)  are  closer  and  hence  have  a  much  higher  power  level 
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than  the  intended  transmitter,  which  happens  to  be  far  away.  Whereas  in  TDM  A 
only  one  transmitter  is  transmitting  at  a  time  (thus  this  is  not  a  problem)  and  in 
FDMA,  bandpass  filter  stopband  attenuation  can  be  big  (problem  greatly  reduced), 
in  CDMA  orthogonality  is  only  approximate  (correlation  operation  is  not  perfect, 
resulting  in  the  partial  correlation  problem).  Recognizing  that  decoding  by  match 
filtering  (discussed  below)  happens  over  the  time  duration  T,  inaccurate  duration 
timing  results  in  partial  correlation.  Inaccurate  duration  timing  are  due  to,  among 
other  factors,  inaccurate  synchronization  between  the  transmitting  channels,  as  well 
as  propagation  delays.  The  phase  relationship  between  duration  timing  of  different 
transmitters  can  be  arbitrary.  Let  us  go  to  Figure  1.4b,  which  are  for  2  channels, 
channel  1  and  2,  with  channel  response  hj  and  h2,  having  difference  in  their  delay 

given  as  x.  Since  they  can  interfere  each  other  correlation  between  them  (partial 

T— x 

correlations)  should  be  zero  to  maintain  orthogonality  i.e.   J   h\  {i)hi{t  +  x)dt  =  0 

T  0 

and      [   h\(t)hx{t  +  x  —  T)dt  =  0,  In  reality,  since  x  varies,  these  two  partial 

T-t 

correlations  are  not  identically  zero.  The  goal  should  thus  be  to  reduce  them 
under  any  value  of  x.  Since  the  variability  in  x  increases  with  the  number  of 
users,  achieving  this  goal  means  restricting  the  number  of  users.  This  number  is 
typically  fewer  than  that  in  TDMA  and  FDMA  case. 


1.4.4     OFDMA 

Another  approach  to  allow  multiple  access  is  OFDM.  Similar  to  CDMA,  OFDM 
achieves  this  by  exploiting  the  orthogonality  property.  Unlike  CDMA,  where  the 
orthogonality  is  between  sequences  ("code")  assigned  to  each  transmission  wave- 
form, here  orthogonality  is  between  the  carriers.  Hence  OFDM  consists  of 
multicarrier  transmission.  Specifically  a  single  high  rate  carrier  data  stream  is 
transmitted  over  numerous  lower  rate  subcarriers  (SC).  In  that  sense,  in  addition 
to  being  an  access  approach,  it  can  also  be  seen  as  a  modulation  technique 
(modulation  technique  is  discussed  in  next  section). 

In  a  straightforward  implementation  of  multicarrier  transmission,  the  signal 
frequency  band  is  divided  into  N  disjoint  frequency  subchannels,  each  modulated 
with  a  separate  symbol.  The  subchannels  are  then  multiplexed  in  the  frequency 
domain.  Because  the  subchannels  are  nonoverlapping,  interchannel  interference  is 
eliminated.  The  disadvantage  of  this  is,  obviously,  an  inefficient  use  of  the 
spectrum.  To  overcome  this,  overlapping  subchannels  can  be  used  instead.  Such 
an  overlapping  multicarrier  technique  is  shown  in  the  bottom  of  Figure  1,5.  Here 
there  are  eight  subcarriers.  For  comparison,  the  conventional  multicarrier  approach 
is  shown  on  top,  where  there  is  no  overlap.  It  can  be  seen  that  the  savings  in 
bandwidth  is  close  to  50%.  To  realize  this  technique,  however,  we  need  to  reduce 
crosstalk  between  SC,  which  means  orthogonality  is  required  and  hence  the  tech- 
nique is  called  OFDM.  This  dictates  that  the  separate  frequencies  follow  a 
mathematical  relationship.  In  contrast,  in  a  typical  FDMA  system,  no  such  relation- 
ship is  required.  Also  shown  in  the  bottom  of  Figure  1 .5  is  (a)  an  example  channel 
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Fig.  1.5    OFDM  combat 
fading 


Fig.  1.6    OFDM  multi  user 
access 


Conventional  multicarrier 
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MM  Savings 


Example 
channel 
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(Af)e 
OFDM 

OFDMA  -  Multi  User  Access 


response,  with  coherence  bandwidth  (Af)c,  (b)  sub-carriers  subjected  to  fades.  As  seen 
only  a  part  of  the  subcarriers  ( the  middle  4)  are  subjected  to  fading  under  this  example 
channel  and  so  OFDM  is  more  robust  towards  fading.  This  will  be  explained  in  more 
detail  later,  in  section  1.10,  after  the  concept  of  fading  and  coherence  bandwidth  is 
introduced.  Next  we  turn  to  Figure  1.6,  which  shows  how  OFDM  can  be  used  for 
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access.  Let  us  assume  there  are  4  users:  A,  B,  C,  D.  Users  A,  B  are  each  allocated 
2  bands,  while  users  C,  D  are  each  allocated  2  bands.  Notice  though  the  multi  bands  for 
each  user  are  not  adjacent  (unlike  in  FDMA  case).  This  means  if,  again,  due  to  fading 
(and  coherence  bandwidth  is  small),  one  of  the  multi  bands  will  be  affected  (e.g.  deep 
fade  happen  at  the  first  band).  However,  2nd  and/or  3rd  bands  are  not  affected.  In  this 
sense,  redundancy  is  built  in.  In  addition  because  right  now  a  variable  number  of 
channels  can  be  assigned  to  different  users,  that  means  the  bandwidth  allocated  to  a 
particular  user  is  flexible,  and  can  be  changed  dynamically. 


1.5     Modulation  Schemes 

There  are  many  modulation  schemes  for  transmitting  digital  data  from  the  transmitter 
to  the  receiver  in  wireless  communication.  We  focus  on  the  modulation  schemes  that 
will  lead  us  to  MSK,  as  it  is  commonly  used  in  most  wireless  standards  (including 
DECT).  More  comprehensive  treatment  on  modulation  schemes  can  be  found  in  [1]. 

The  choice  of  modulation  schemes  influences  performance,  such  as  BER,  SNR, 
and  message  bandwidth.  This  can  affect  the  receiver  design  in  both  a  high  and  a  low 
level  fashion.  At  a  high  level,  modulation  schemes  can  be  chosen  that  will  lead  to  a 
reduced  message  bandwidth  and  hence  higher  channel  capacity.  However,  such 
design  activities  are  usually  of  interest  only  to  communication  system  engineers  and 
will  not  be  our  main  concern.  On  the  other  hand,  modulation  schemes  can  affect  the 
receiver  design  at  a  low  level  as  well.  For  example,  modulation  schemes  with  good 
BER  performance  increase  the  receiver's  resistance  to  noise  in  channel,  such  as 
AWGN  and  resistance  over  destructive  interference  coming  from  multipath  fading. 
Consequently,  these  better  modulation  schemes  make  the  receiver  design  easier  by 
lowering  the  required  SNR  (reducing  the  sensitivity  requirement).  We  call  these 
schemes  low-level  receiver  design  activities.  These  design  activities  are  of  significant 
interest  to  integrated  circuit  designers  and  will  be  our  main  focus. 

For  any  modulator,  demodulation  can  be  done  either  in  a  coherent  or  incoherent 
fashion.  For  coherent  demodulation  we  assume  that  the  receiver  has  exact  knowledge 
of  the  carrier  wave's  phase  reference,  in  which  case  we  say  that  the  receiver  is  phase 
locked  to  the  transmitter.  In  noncoherent  demodulation,  knowledge  of  the  carrier 
wave's  phase  is  not  required.  The  complexity  of  the  receiver  is  thereby  reduced, 
although  at  the  expense  of  inferior  error  performance.  In  the  following  we  concentrate 
mainly  on  coherent  demodulation. 


1.5.1     Binary  Frequency  Shift  Keying 

With  binary  frequency  shift  keying  (BFSK),  the  modulation  happens  on  the  frequency 
of  the  carrier.  As  the  binary  input  signal  changes  from  a  logic  0  to  a  logic  1 ,  and  vice 
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Acoscojt                    X         SBFSK(t) 
+  )—*■ 
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Fig.  1.7    Block  diagram  of  a  binary  FSK  modulator 
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Fig.  1.8    Block  diagram  of  a  binary  FSK  coherent  demodulator 

versa,  the  FSK  output  signal  shifts  between  two  frequencies,  a  logic  1  frequency  (/,) 
and  a  logic  0  frequency  (/0).  A  possible  implementation  of  a  FSK  modulator  is  shown 
in  Figure  1 .7. 

Figure  1.8  shows  a  binary  FSK  coherent  demodulator.  A  BFSK  demodulator 
works  by  correlating  the  input  with  the  function  (f>(t)  =  Acosco0/-  Acosco^. 
Mathematically,  this  correlation  is  achieved  by  having  the  received  signal  multiplied 
by  the  correlation  function  (f>(t)  and  then  integrated  over  one  bit  duration  Th,  as 
described  in  (1.1),  and  (1.2).  Let  us  assume  that  the  channel  is  ideal  for  the  time 
being  and  so  the  modulator  output  SBfsk(0  *s  me  same  as  the  demodulator  input 
5Bfsk(0-  Hence  the  correlator  in  Figure  1.8  can  have  two  possible  outputs  depending 
whether  a  logic  0  or  a  logic  1  was  transmitted. 


n 


logic  0  :      So(t) 


(A  cos  coot) (A  cos  coot  —  A  cos  <±>it)dt 


A2Th 


(1-D 


logic  1  :      s\(t) 


(A cos (o 1 1) (A cos ojQt  —  Acosa>it)dt  ■ 


A2Th 


(1-2) 


Note  that  the  expressions  (1.1)  and  (1.2)  are  exact  only  if  cosco0f  and  coscojf  are 
orthogonal,  and  therefore  their  frequencies  have  to  satisfy  the  relation  f0  =  nQ/Th 
and/!  =  nJTh,  where  nx  and  n2  are  some  integers. 
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Fig.  1.9    Block  diagram  of  a  binary  PSK  modulator 
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Fig.  1.10   Block  diagram  of  a  binary  PSK  coherent  demodulator 


The  correlator  output  is  then  compared  with  a  threshold,  set  to  be  0.  Thus,  a 
simple  threshold  detector  can  determine  which  message  was  actually  transmitted, 
yielding  the  baseband  binary  data. 


1.5.2    Binary  Phase  Shift  Keying 

With  binary  phase  shift  keying  (BPSK)  the  modulation  happens  on  the  phase  of  the 
carrier.  Two  output  phases  are  possible  for  a  single  carrier  frequency.  One  output 
phase  represents  a  logic  0  and  the  other  a  logic  1.  As  the  input  digital  signal 
changes,  the  phase  of  the  output  carrier  shifts  between  two  angles  that  are  180° 
out  of  phase.  Therefore,  the  two  signals,  So(t)  and  S\(t),  that  are  used  to  present  binary 
symbols  0  and  1  are  defined  by  So(t)  =  Acoscor/  and  Si(t)  =  Acos(cc>r/  +  %)  =  -Acoscor/ 
respectively.  The  implementation  of  a  binary  PSK  modulator  is  shown  in  Figure  1 .9. 
A  binary  PSK  coherent  demodulator  is  shown  in  Figure  1.10.  Again  we  assume 
that  the  channel  is  ideal  for  the  time  being  and  so  the  modulator  output  5"Bpsk(0  is 
the  same  as  the  demodulator  input  SbpskCO-  Since  sQ(t)  =  -Si(t)  (referred  to 
as  antipodal  signals),  the  correlating  signal  in  the  detector  is  simply  cp(t)  =  Acosff>0f. 
Furthermore  we  fix  the  carrier  frequency  f0  to  be  equal  to  n0/Th  for  some  integer  «0  in 
order  to  provide  a  proper  correlator  output.  Obviously,  we  have  two  possible  correlator 
outputs  depending  on  whether  a  logic  0  or  a  logic  1  was  actually  transmitted. 


logic  1  :      Si(t)  = 


(— A  cos  a>ot)  (A  cos  a>(,t)dt  ■■ 


A2Th 


(13) 
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logic  0  :       50  (0  = 


A2Ti 
(A  cos  coot)  (A  cos  a>ot) dt  — (1.4) 


Then  the  integrator  output  is  compared  to  a  threshold  of  0,  yielding  the  baseband 
binary  data. 


1.5.3     Quadrature  Phase-Shift  Keying 

Quadrature  phase  shift  keying  (QPSK)  is  another  form  of  phase-modulated, 
constant-envelope  digital  modulation.  In  contrast  to  BPSK  and  BFSK,  QPSK  has 
more  than  two  representations  for  the  input.  These  multirepresentation  signals  are 
called  M-ary  signals,  where  M  stands  for  the  possible  number  of  representations  of 
the  signal.  QPSK  is  an  M-ary  encoding  technique  where  M  —  4,  since  a  QPSK  output 
signal  has  four  possible  output  phases.  Note  that  in  general  we  talk  about  M-ary 
signaling  when  the  following  relation  is  satisfied:  N  =  log2M,  where  N  is  the  number 
of  bits  at  the  input  of  the  modulator  and  M  is  the  number  of  output  conditions 
possible  with  N  bits.  Accordingly,  with  QPSK  modulation  four  output  phases  are 
possible  for  a  single  carrier  frequency.  Obviously,  the  four  different  output  phases 
must  be  characterized  by  four  different  input  conditions.  Since  the  digital  input  of  a 
QPSK  modulator  is  a  binary  signal,  it  takes  more  than  a  single  input  bit  to  produce 
four  different  input  conditions.  With  2  bits  (N  —  2),  there  are  four  possible 
conditions:  00,01,10,  and  11.  Therefore,  with  QPSK,  the  binary  input  data  are 
combined  into  groups  of  2  bits  (also  called  dibits).  Each  dibit  code  generates  one 
of  four  possible  output  phases. 

A  possible  implementation  of  a  QPSK  modulator  is  shown  in  Figure  1.11. 
We  need  to  add  a  demultiplexer,  which  generates  a  dibit  sequence  from  a  binary 
input  data  stream.  The  task  of  the  demultiplexer  is  to  separate  the  binary  bit  stream 
into  an  "upper  arm"  binary  sequence  and  into  a  "lower  arm"  binary  sequence. 
The  demultiplexer  separates  the  input  sequence  in  such  a  way  that  1  bit  goes  into 
the  upper  arm  (/-channel)  and  the  following  bit  goes  into  the  lower  arm  (Q-channel). 
Obviously,  the  bit  rate  in  each  channel  is  now  equal  to  half  of  the  actual  bit  rate  of  the 
input  data  into  the  bit  splitter.  The  binary  sequence  in  each  channel  now  modulates  a 
carrier  wave  in  the  same  manner  as  a  BPSK  modulator.  An  additional  device  has  to 
make  sure  that  the  carrier  signal  from  the  /-channel  (sinff>c?)  lies  90°  out  of  phase  with 
respect  to  that  of  the  Q-channel  (coscocf).  Finally,  two  corresponding  BPSK  signals 
are  added  together  by  a  linear  summer  device  to  form  the  QPSK  output  signal. 
This  output  signal  consists  of  an  in-phase  component  S/(t)  and  a  quadrature  compo- 
nent 5e(0  and  can  be  expressed  as  5qPSK(/)  =  ±  Acoscocf  ±  Asin(act.  Note  that  the 
output  signal  only  changes  after  two  consecutive  bits  are  clocked  in.  Such  a  signal 
produces  four  possible  message  points  in  a  two-dimensional  signal  space  (M  =  4). 

A  QPSK  demodulator  is  shown  in  Figure  1.12.  The  QPSK  demodulator  consists 
of  a  pair  of  correlators  with  a  common  input  and  a  corresponding  pair  of  coherent 
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Fig.  1.11    Block  diagram  of  a  QPSK  coherent  modulator 
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reference  signals  (f>\(t)  =  coscocf  and  02(0  —  sincoc?  and  so  the  modulator  output 
Sqpsk(0  is  the  same  as  the  demodulator  input  Sqpsk(0-  Notice  that  the  in-phase 
component  Sj(t)  and  the  quadrature  component  Sq(j()  of  the  received  signal  can  be 
detected  independently  in  an  upper  arm  and  in  a  lower  arm  since  .V/(f)coscocf  and 
5g(f)sincocf  are  orthogonal.  Therefore,  each  arm  produces  a  correlator  output  in  the 
same  way  as  the  BPSK  receiver.  The  integrator  outputs,  Xj  and  Xq,  are  each 
compared  in  a  threshold  detector  with  a  threshold  of  0.  For  the  /-channel  if  Xj  >0, 
a  decision  is  made  in  favor  of  logic  1 ,  and  if  Xj  <  0  a  decision  is  made  in  favor  of 
logic  0.  The  Q-channel  (with  integrator  output  Xq)  works  in  the  same  way  and  the 
result  is  independent  of  Xj  (in-phase  signal).  Finally,  the  two  output  binary 
sequences  from  the  /-channel  and  the  Q-channel  are  combined  in  a  multiplexer  to 
reproduce  the  actually  transmitted  message. 

A  few  comments  are  now  in  order  for  QPSK.  First,  we  can  look  at  QPSK  as  an 
extension  of  BPSK  and  BFSK.  Second,  we  can  also  look  at  it  as  the  first  example  of 
a  broad  class  of  modulation,  called  quadrature  modulation,  which  is  characterized 
by  subdividing  the  binary  bit  streams  into  pairs  of  2  bits  (dibits)  and  where  each 
dibit  is  mapped  onto  one  of  four  levels  before  modulation.  Quadrature  modulation 
is  broadly  divided  into  two  classes:  quadrature  phase  shift  keying  and  its  variants, 
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Fig.  1.13    Block  diagram  of  a  OQPSK  modulator 


and  minimum  shift  keying  (MSK)  and  its  variants.  Later,  when  we  see  that  the 
channel  is  no  longer  ideal  but  is  corrupted  by  noise  and  has  a  finite  bandwidth,  these 
different  modulation  schemes  can  be  interpreted  as  different  ways  of  trading  off 
BER,  message  bandwidth  occupied,  and  ISI  performance.  In  the  next  section  we 
will  cover  some  variants  of  QPSK,  like  offset  QPSK  (OQPSK).  The  discussion  on 
OQPSK  will  then  carry  us  over  into  the  next  category  of  quadrature  modulation, 
namely  minimum  shift  keying  (MSK). 


1.5.4    Offset  Quadrature  Phase  Shift  Keying 

Let  us  assume  the  channel  is  no  longer  ideal  but  has  a  finite  bandwidth.  Since  QPSK 
has  large  phase  changes  at  the  end  of  each  symbol,  the  bandwidth  of  the  symbol  is 
rather  large.  If  the  symbol  bandwidth  becomes  comparable  to  the  channel  band- 
width, the  transmitted  symbol  will  be  distorted  by  the  channel.  To  mitigate  this 
effect,  a  variant  called  offset  QPSK  (OQPSK)  is  introduced,  whose  transmitter  is 
shown  in  Figure  1.13.  Notice  that,  when  compared  with  Figure  1 . 1 1 ,  a  time  delay  Th 
is  introduced  in  the  Q-path  so  that  the  /,  Q  paths  are  offset  in  time  by  half  the  symbol 
period.  This  avoids  simultaneous  transitions  in  waveforms  at  nodes  A  and  B.  Instead 
of  having  a  phase  step  of  180°,  the  phase  step  is  now  only  90°. 


1.5.5    Minimum  Shift  Keying 


Another  method  to  avoid  large  phase  changes  at  the  end  of  each  symbol  is  to  adopt  a 
modulation  scheme  that  has  continuous  phase  shift.  One  such  modulation  scheme  is 
MSK.  This  new  modulation  scheme  can  be  derived  from  OQPSK  by  applying  half- 
sinusoids,  instead  of  rectangular  pulses,  to  represent  the  levels  that  are  multiplied  by 
the  carriers.  The  resulting  modulator  is  shown  in  Figure  1.14.  Now  a^  is  selected 
such  that  (B1=7i/(2r/,).  By  doing  so  Smsk(0  can  be  shown  to  exhibit  no  abrupt 
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Fig.  1.14    Block  diagram  of  an  MSK  modulator 

change  in  phase.  Thus,  there  is  also  no  abrupt  change  in  the  slope  of  the  SMsk(0- 
From  a  frequency  domain  point  of  view,  this  means  MSK  exhibits  a  sharper  decay 
in  its  spectrum  than  QPSK,  or  a  lower  side-lobe  signal  power.  Hence  for  a  channel 
with  a  finite  bandwidth,  less  distortion  is  introduced. 


1.5.6    MSK:  Another  Viewpoint 

We  can  also  view  MSK  as  a  form  of  frequency  shift  keying,  except  this  time  it  is  a 
continuous-phase  frequency  shift  keying  (CPFSK).  Essentially,  MSK  is  binary  FSK 
except  that  the  logic  0  and  the  logic  1  frequencies  are  synchronized  with  the  binary 
input  bit  rate.  Synchronous  simply  means  that  there  is  a  precise  timing  relationship 
between  the  two;  it  does  not  mean  that  they  are  equal.  With  MSK,  the  logic  0  and 
logic  1  frequencies  are  selected  such  that  they  are  separated  from  the  fundamental 
frequency  by  an  exact  odd  multiple  of  one-half  of  the  bit  rate.  In  other  words,  f0  and/\ 
equal  nRh/2,  where  Rb  is  the  input  bit  rate  and  n  is  any  odd  integer.  This  ensures  that 
there  is  a  smooth  phase  transition  in  the  modulated  signal  when  it  changes  from/0  to/i, 
and  vice  versa.  Each  transition  occurs  at  a  zero  crossing  and  there  exists  no  phase 
discontinuities.  However,  the  disadvantage  of  MSK  is  that  it  requires  synchronizing 
circuits,  and  the  complexity  of  the  implementation  increases  considerably. 
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1.6.1    Additive  White  Gaussian  Noise 


Let  us  refer  again  to  Figure  1.1.  In  section  1.5  we  described  the  various  modulation 
schemes  as  if  the  channel  is  ideal,  except  for  the  few  cases  towards  the  end,  when  we 
introduced  the  concept  of  a  channel  having  finite  bandwidth.  In  real  life,  the  channel 
also  has  noise  in  it.  What  happens?  Obviously,  for  a  given  signal  amplitude  A,  if  the 
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Fig.  1.15    Block  diagram  of  a  communication  system  using  BFSK  and  having  arbitrary  pulse 
shape  p(t) 


noise  becomes  large  enough,  we  are  going  to  make  errors.  Such  error  cannot  exceed 
the  error  as  dictated  for  a  given  standard,  usually  specified  as  BER.  Here  we  would 
derive  the  BER  as  a  function  of  the  SNR  for  a  given  modulation/demodulation 
scheme,  assuming  the  channel  is  corrupted  with  AWGN  only.  We  will  make  the 
derivation  first  for  the  simple  modulation  scheme:  BFSK.  We  will  then  extend  and 
state  the  result  for  MSK. 

Let  us  redraw  Figure  1.1  in  Figure  1.15  with  the  following  modification:  the 
back  end  and  front  end  have  all  been  removed.  Hence  the  transmitter  is  equivalent 
to  the  modulator  and  the  receiver  is  equivalent  to  the  demodulator.  The  channel 
is  ideal  except  that  there  is  AWGN.  This  is  represented  by  having  an  ideal  channel 
with  noise  n{t)  injected  into  it.  The  modulator/demodulator  uses  one  of  the  modu- 
lation schemes  described  previously.  Hence  we  use  a  general  representation  for 
the  correlation  signal,  p(t).  For  example,  for  BFSK  we  substitute  po(t)=  Acosco0f 
and  pi(t)—Acosd)it.  The  demodulator  part  in  Figure  1.15  would  then  agree  with 
Figure  1.8,  the  demodulator  for  BFSK. 

First,  let  us  consider  the  signal  x(t)  at  t=  Th:  it  can  have  one  of  two  values  A  or 
-A,  with  the  decision  threshold  set  at  0.  Its  probability  distribution  function  (pdf )  is 
shown  in  Figure  1.16(a).  If  there  is  no  noise  in  the  channel  the  received  signal  x(Th) 
will  also  have  the  same  pdf.  Now  let  us  consider  the  AWGN  n(t).  Since  it  is 
Gaussian  noise,  n(Th)'s  pdf  is  as  shown  in  Figure  1.16  (b),  with  standard  deviation 
cr„.  With  noise  in  the  channel  x(Th)  becomes  A  +  n(Th)  or  -A  +  n(Th).  Since  n(t)  is 
another  random  variable  that  is  independent  from  the  symbol  x(Th),  when  they  add, 
their  probability  density  functions  become  convolved.  The  pdf  of  x(Th)  now 
changes  to  that  shown  in  Figure  1.16(c).  Notice  that  if  the  noise  variance  is  large 
enough,  as  is  the  case  here,  then  the  two  pdfs'  that  correspond  to  logic  0  and  logic  1 
overlap  significantly.  This  means  an  error  occurs  when  A  -  n(Th)  becomes  less  than 
zero,  or  when  -A  +  n(Th)  is  larger  than  zero.  Now  the  probability  of  having  logic 
0  (—A)  is  the  same  as  having  logic  1  (A)  and  equal  to  Vi.  Hence  the  probability  of 
making  an  error  when  sending  logic  0  (-A)  is  just  the  product  of  Vi  and  the 
probability  of  making  an  error  conditional  upon  this  event.  The  probability  of 
making  an  error  conditional  upon  this  event  is,  of  course,  given  by  the  area 
underneath  the  pdf  curve  of  x(Th)  curve  corresponding  to  logic  0  (-A),but  extending 
from  0  to  oo.  This  is  indicated  by  the  shaded  part  in  Figure  1.16(c).  Therefore,  the 
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Fig.  1.16    Illustration  of  PDF  with  additive  noise 

total  probability  of  making  an  error  when  transmitting  logic  0,  denoted  by  Pe0,  is 
given  by 


PeO 


1 


^exp-t'-A-^   * 

ilnoz 


lal 


(1.5) 


Here  an  is  the  standard  deviation  of  the  pdf  for  n(t).  Now  from  symmetry 
Pe\=Peo.  Hence  the  total  error,  Pe,  is  2Pe0. 

Substituting  this  in  (1.5)  and  with  the  proper  change  of  variable,  we  have 


-eW^dy 


sjlk 


(1-6) 


Notice  that  the  integral  is  the  familiar  complementary  error  function,  usually 
denoted  as  Q  (not  to  be  confused  with  quality  factor  Q).  In  shorthand  form  we  have 


Pe  =  Q(A/an) 


(1-7) 


The  advantage  of  this  expression  is  that  we  can  see  that  Pe  is  dependent  on  A/ct„, 
which,  as  will  be  shown,  is  related  to  the  SNR.  How  do  we  derive  the  expression  of 
P„  in  term  of  SNR? 
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First  let  us  define  energy  of  the  detected  signal  to  be  Ed.  Then  referring  to 

oo 

Figure  1.15  again,  Ed  =     j    \po(t)  —  P\{i)\  dt.  Now  the  noise  n(t)  in  the  channel 

— oo 

has  a  power  ofN0/2.  Taking  the  ratio  we  have  (energy  of  detect  signal)/noise  —2Ed/N0. 
But  from  the  SNR  definition,  since  the  pulse  has  amplitude  A  —  (—A)  =  2A,  then  the 
ratio,  (energy  of  detect  signal)/noise,  can  also  be  written  as  (24)  /  a„2.  Equating  these 
two  expressions  for  the  ratio,  we  have  (24)  /  rj„  =  2EJN0.  We  can  then  express  A/a„ 
as  v/(Ed/(2N„)).  Substituting  into  (1.7),  we  have  [2] 

Pe  =  Q(y/(Ed/(2N0)j)  (1.8) 

Let  us  apply  (1.8)  to  various  modulation  schemes  and  let  us  start  with  BFSK. 
What  are  p0  and  px  for  BFSK?  Since  we  assume  coherent  detection,  then  p0  and  px 

oo 

are  orthogonal  and  so  we  have  Ed  =    J  p\(i)  +  p2(t)dt.  If  p{t)  are  sinusoids,  then 

— oo 

Ed  becomes:  Ed  =  A2T/,.  Here  A  is  the  amplitude  of  the  sinusoid  and  Th  is  the 
period.  Let  us  now  define  a  new  term,  called  the  average  energy  per  bit,  and  denote 
it  as  Eh.  Again,  if  p(t)  are  sinusoids,  then  Eh=A2Th/2.  Hence  Ed=2Eh.  Substituting 
into  (1.8),  we  have 

Pe  =  Q(yJ{Eh/N0j)  (1.9) 

Next  we  turn  to  BPSK,  where  we  have 

oo  Th 


Ed  = 


\po(t)-pi(t)\2dt  = 


(2A  cos  m0t)  dt  =  2A2T, 


h 


Again  Eh=A  Th/2,  and  therefore  this  time  we  have  Ed=4Eh.  Substituting  into 
(1.8),  we  have 


Pe  =  QW(2Eh/N0))  (1.10) 


We  can  go  through  similar  derivations  for  QPSK  and  MSK. 

Now  that  we  have  found  expression  of  P e  in  terms  of  Eh/NQ,  we  want  to  relate 
EhINQ  to  SNR  and  hence  express  Pe  in  terms  of  SNR  [3].  The  SNR  can  be  related  to 
Eh/N0  by  the  following  relationship: 

Eh/N0  =  SNRx(fN/Rh)  (1.11) 

where  fN  is  the  effective  noise  bandwidth  and  Rh  is  the  effective  symbol  rate  (or 
gross  data  rate). 
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Fig.  1.17    BER  as  a  function 
of  SNR 
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We  can  apply  (1.11)  to  (1.9),  (1.10),  and  obtain  Pe  expressions  for  BFSK  and 
BPSK.  For  example,  for  the  BPSK  case,  (1.10)  becomes 


Pe  =  G(V(2  x  SNR  x  {fN/Rb))) 


(1.12) 


We  can  make  one  more  comment:  Pe  is  the  probability  of  making  an  error  when 
we  transmit  a  symbol,  which  gives  us  the  BER.  Hence  (1.12)  can  be  interpreted  as 
follows:  for  a  given  BER,  (1.12)  tells  us  the  required  SNR  needed  at  the  input  of  the 
demodulator  to  achieve  this  BER.  Plots  of  BER  versus  SNR  are  normally  derived 
for  various  modulation  schemes  and  represent  key  design  information.  As  an 
illustration  we  have  given  such  plots  for  BPSK  and  GMSK  in  Figure  1.17. 


1.6.1.1     Numerical  Example  1.1 

From  Table  1.1  it  can  be  seen  that  for  the  DECT  standard  a  BER  of  10"3  is  needed 
and  that  it  uses  the  modulation  scheme  GMSK.  Apply  this  to  Figure  1.17  we 
conclude  that  the  SNR  needed  at  the  input  of  the  demodulator  is  9  dB,  or 


SNR  =  9dB 


(1.13) 


This  number  will  be  used  later  in  the  receiver  design. 

The  above  derivation  assumes  that  the  channel  impairment  only  comes  from 
AWGN  and  that  the  received  power  is  constant.  For  the  case  when  the  pulse  is  a  sine 
wave,  A,  the  amplitude  of  the  receiving  sine  wave,  is  constant.  In  reality,  this  is  not 
the  case.  Wireless  environments  suffer  from  path  loss  and  multipath  fading,  which 
makes  A  substantially  smaller  during  a  fraction  of  the  signaling  interval.  This  will 
require,  in  general  a  much  larger  SNR  to  achieve  the  same  BER.  In  subsection  1.9.2 
we  will  rederive  this  SNR. 


22 


1     Communication  Concepts:  Circuit  Designer  Perspective 


a  Magnitude  [W] 


Fig.  1.18    Baseband  PAM  signal  was  modulated  on  a  high-frequency  carrier  signal  with  unity 
amplitude 


1.6.2    Finite  Channel  Bandwidth 


In  Table  1.1,  we  see  that  DECT  uses  GMSK.  What  is  GMSK?  GMSK  is  just  MSK 
with  a  particular  pulse  shaping.  In  this  sub-section  we  discuss  how  finite  bandwidth 
in  channel  introduces  ISI  and  how  pulse-shaping  technique  combats  ISI.  Referring 
back  to  Figure  1.1,  let  us  imagine  there  are  only  the  source/modulator/channel/ 
demodulator  blocks.  The  baseband  signal  from  the  source  is  modulated  on  a  carrier 
frequency  for  transmission.  This  is  called  a  baseband  pulse  amplitude  modulator 
(PAM)  signal.  Thus,  the  baseband  signal  forms  some  sort  of  envelope  over  the  high- 
frequency  carrier  signal,  as  shown  in  Figure  1.18. 

If  the  pulse  that  the  modulator  sends  is  a  rectangular  pulse,  then  the  envelope  is 
rectangular.  If  the  channel  is  ideal,  there  is  no  problem.  However,  if  the  channel  is 
bandlimited,  the  pulses  will  spread  in  time.  The  pulse  of  each  symbol  will  smear 
into  the  time  intervals  of  succeeding  symbols,  causing  ISI  and  thus  degrading  the 
BER.  In  addition,  such  smearing  causes  out-of-band  radiation  in  the  adjacent 
channel.  Even  though  both  of  these  can  be  improved  by  increasing  the  channel 
bandwidth,  it  is  costly  to  do  so.  Alternatively,  we  can  manipulate  the  RF  spectrum 
of  the  transmitted  symbol.  Usually,  this  is  difficult  to  do,  and  spectrum  manipulation 
is  done  at  the  baseband  instead.  Such  manipulation  is  called  pulse  shaping  and  is 
done  inside  the  modulator  block  by  the  transmit  filter,  whereby  the  rectangular  pulse 
is  shaped  in  a  particular  way  before  being  transmitted.  Similarly,  on  the  receiver 
side,  inside  the  demodulator  block,  there  is  a  filter,  called  the  receive  filter,  that  is 
specifically  designed  to  receive  this  particular  pulse  shape.  Henceforth,  to  simplify 
explanation,  we  will  eliminate  the  carrier  in  our  discussion  and  talk  as  if  the  channel 
works  directly  on  the  baseband  pulses. 
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Fig.  1.19    A  sine  pulse  in  the  time  domain  (a)  and  its  frequency  response  (b).  The  interval  Tis  the 
same  as  Th  and  will  be  taken  as  T  =  1  ms  in  the  text  discussion  for  illustration  purposes 

We  observe  that,  in  general,  the  combined  effects  of  the  transmit  and  receive 
filters,  and  the  propagation  media,  determine  the  pulse  p(t).  Let  us  consider  a  pulse 
that  has  a  maximum  value  of  unity  at  some  time  to  and  has  zero  value  at  all  time 
instants  tQ+kT,  where  k  is  an  integer.  If  we  apply  this  pulse  to  the  modulator  in 
Figure  1.1,  then  xR(kTh),  the  output  of  the  demodulator  at  the  k  th  symbol  instant  in 
the  receiver,  can  be  written  as  (for  a  noiseless  channel) 


xR(kTb)  =   ^2  AmP(kTh  -  mTh) 


(1.14) 


=  A0 


■B=+O0 

E 


A„,p(kTh  -  mTh) 


(1.15) 


Here  Am  is  a  coded  sequence  of  symbols  (for  example,  a  binary  sequence  with  two 
levels  0  and  1),  and  p(t)  represents  the  pulse  of  this  sequence.  Notice  that  in  (1.14)  AQ 
is  the  k  th  transmitted  symbol,  which  is  the  desired  received  output.  The  second  term 
contains  all  the  undersirable  interference  and  represents  what  we  call  ISI. 

It  can  be  shown  that  if  p(t)  in  (1.14)  is  a  sine  pulse,  as  shown  in  Figure  1.19, 
neighbouring  symbols  will  not  interfere  with  one  another,  provided  that  we  make 
the  decision  at  t—kTh.  The  sine  pulse  satisfies  the  Nyquist  criterion,  which  requires  a 
pulse  to  have  regular  zero  crossings  spaced  at  multiples  of  the  signaling  interval  so 
that  at  the  receiver  the  transmitted  data  symbols  can  be  detected  without  any  mutual 
interference. 

To  explain  more  clearly  how  adopting  a  sine  pulse  can  eliminate  ISI,  let  us  go 
through  the  following  example.  We  assume  that  we  have  a  bipolar  format  where  for 
the  k  th  symbol  a  logic  1  is  represented  by  + 1  V  at  time  instances  t  —  kTh  and  a  logic 
0  is  represented  by  -1  V  at  time  instances  t  —  kT/,.  For  ease  of  illustration  we  look  at 
a  transmission  sequence  that  consists  of  only  2  bits:  b0  at  -1  ms  and  bx  at  1  ms, 
where  at  —  1  ms  a  logic  0  and  at  + 1  ms  a  logic  1  are  transmitted,  as  shown  in  Figure 
1.20.  Let  us  first  take  a  look  at  bit  b\,  whose  decision  is  made  at  t  =  1ms.  What  does 
the  bit  b0  does  to  b{!  Notice  after  -1  ms  bit  b0  still  has  some  residual  energy, 
although  it  is  decaying.  However,  because  the  pulse  is  a  sine  pulse,  its  crosses  zero 
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Fig.  1.20    A  baseband  PAM  signal  represented  by  a  sine  pulse,  where  there  is  a  logic  0  at  -1ms 
and  a  logic  1  at  1ms 


at  1  ms,  which  is  exactly  the  time  when  we  determine  the  value  of  b\.  Hence  bo's 
energy  does  not  interfere  with  b^s  energy  at  ft;'s  decision  time  and  there  is  no  ISI. 
Having  shown  that  ideally  a  sine  pulse  representation  does  not  introduce  ISI,  we 
should  be  aware  that  there  are  two  important  practical  issues  to  keep  in  mind: 

1.  A  sine  pulse  introduces  additional  error  due  to  timing  jitter  of  the  sampling 
clock.  The  eye  diagram  illustrates  the  degradation  (Figure  1.21).  The  eye 
diagram  is  easily  generated  in  practice  using  an  oscilloscope,  where  the  symbol 
timing  Th  serves  as  the  trigger.  An  eye  diagram  consists  of  many  overlaid  traces 
of  small  sections  of  the  received  signal.  Signal  thresholds  located  at  the  vertical 
midpoints  of  the  eyes  dictate  the  data  value  decided  for  each  sample.  With  ISI, 
the  eye  opening  will  close  vertically.  When  there  is  incomplete  vertical  closure, 
the  ISI  will  reduce  the  immunity  against  other  nonideal  effects,  such  as  AWGN, 
and  the  receiver  will  fail  to  detect  the  actually  transmitted  sequence.  Hence  the 
wider  the  vertical  eye  opening,  the  greater  the  noise  immunity.  With  AWGN 
present,  closing  of  the  eye  will  degrade  the  achievable  BER  for  a  given  SNR. 
Hence  the  impact  of  ISI  can  be  appreciated  by  its  effect  on  the  BER. 

2.  With  a  sine  pulse  representation  we  cannot  realize  a  filter  in  practice  with  such  a 
sharp  transition. 
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Fig.  1.21  One  possible  illustration  of  an  eye  diagram,  where  a  unipolar  format  (the  logic  0  is 
represented  by  a  zero  pulse)  has  been  used  to  represent  the  baseband  sequence.  The  vertical  eye 
opening  (a)  indicates  the  immunity  to  all  possible  intersymbol  interference  phenomena.  The 
horizontal  eye  opening  (b)  indicates  the  immunity  to  timing  phase,  and  the  slope  (c)  indicates 
the  sensitivity  to  the  jitter  in  the  timing  phase 


Because  of  these  two  limitations,  there  are  other  pulse  shapes  that  people  adopt. 
One  is  called  a  raised-cosine  pulse  shape,  which  also  satisfies  the  Nyquist  criterion. 
It  is  given  as 


P(t) 


sinc(f/7fc)  cos(nat/Ti,) 
1  -  (2at/Th)2 


(1.16) 


where  0  <  a  <  1  and  a  is  the  rolloff  factor.  This  pulse  has  a  maximum  at  t~0  and  is 
zero  at  all  t=kTh,  as  desired.  We  observe  that  the  raised  cosine  pulse  is  strictly  band 
limited,  but  it  exceeds  the  ideal  minimal  bandwidth  (Nyquist  bandwidth  =  l/Th)  by 
a  certain  amount,  called  the  excess  bandwidth. 

Another  pulse  that  has  been  adopted  is  the  one  generated  by  a  Gaussian  pulse- 
shaping  filter.  This  pulse  does  not  satisfy  the  Nyquist  criterion.  Instead  of 
generating  zero  crossings  at  adjacent  symbol  peaks,  the  Gaussian  filter  possesses 
a  smooth  transfer  function  but  generates  no  zero  crossing.  The  pulse  is  given  by 


P(t) 


exp 


(1.17) 
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Here  a=0.5885/fi,  where  B  is  the  -3  dB  bandwidth  of  the  Gaussian  filter. 
Hence  one  can  see  that  as  a  increases,  the  pulse  occupies  less  bandwidth,  but  this 
leads  to  more  time  dispersion  and  hence  more  ISI.  Consequently,  we  are  trading  off 
spectral  efficiency  to  reduce  ISI.  When  this  pulse  is  applied  to  an  MSK  modulation 
scheme,  the  resulting  scheme  is  called  GMSK. 


1.7     Wireless  Channel  Description 

So  far  we  have  discussed  only  two  nonidealities  of  the  channel:  AWGN  and  finite 
bandwidth.  However,  there  are  other  major  problems  encountered  in  a  wireless 
channel:  path  loss  and  multipath  fading.  In  this  section,  we  discuss  how  to  add  these 
nonidealities  to  the  channel.  The  path  loss  of  the  channel  severely  attenuates  the 
transmitted  signal  and  sets  a  lower  bound  on  the  signal  strength  the  receiver  can 
expect.  Multipath  fading  does  a  few  things.  First,  as  with  path  loss  it  attenuates 
the  transmitted  signal.  Attenuation  introduced  by  multipath  fading  further  adds 
to  the  attenuation  introduced  by  path  loss.  Together  with  AWGN  originally  present 
in  the  channel,  this  attenuation  in  the  received  signal  strength  sets  a  limit  on  the 
SNR  required  of  the  demodulator  to  achieve  a  certain  BER.  The  second  impairment 
brought  about  by  multipath  fading,  distortion,  introduces  ISI,  which  limits  the 
achievable  BER.  Finally,  Doppler  shift  introduces  phase  impairment  to  the 
modulated  signal  received  under  multipath  condition  and  is  another  error  source 
that  can  limit  the  achievable  BER.  To  investigate  these  problems  properly,  we  need 
to  develop  more  complex  channel  models,  namely,  channels  having  randomly  time- 
varying  impulse  responses. 

We  now  discuss  the  channel  model  for  wireless  communication  and  its  impact 
on  receiver  front  end  design.  We  start  by  considering  first  the  path  loss  and 
multipath  fading  happening  in  a  channel,  two  phenomena  that  are  closely  linked. 
As  shown  in  Figure  1.22,  received  power's  variation  in  distance  from  the  transmit- 
ter can  be  understood  by  observing  its  average  value  at  a  given  distance  from  the 
transmitter  as  well  as  its  local  variation  in  close  spatial  proximity  to  that  given 
location[4].  The  first  is  characterized  by  path  loss  and  the  second  by  multipath 
fading.  As  such,  path  loss  describes  a  large-scale  propagation  phenomenon  and 
multipath  fading  describes  a  small-scale  propagation  phenomenon. 

What  leads  to  such  received  power  variation?  Physically,  between  the  transmitter 
and  the  receiver  there  are  many  propagation  paths,  and  signals  traveling  through  these 
different  paths  interfere  with  one  another.  To  illustrate  this,  we  draw  a  simple  picture 
that  incorporates  four  of  these  paths  (Figure  1.23). 
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Fig.  1.22    Received  power  under  path  loss  and  multipath  fading 


Tx  antenna 


Rx  antenna 


Ground,  global  reflector 

Fig.  1.23    Four  paths  are  shown  to  differentiate  the  effect  of  interference  caused  by  global  and 
local  reflectors 


1.7.1     Path  Environment 


Referring  to  Figure  1.23,  we  would  like  to  classify  the  paths  according  to  the 
reflection  they  may  undergo.  The  first  classification  is  according  to  whether  or 
not  they  suffer  from  global  (large-scale)  reflection.  Paths  1  and  2  do  not  go  through 
global  reflections  whereas  paths  3  and  4  do  go  through  such  reflections.  The  second 
classification  is  according  to  whether  or  not  they  suffer  from  local  (small-scale) 
reflection.  Paths  1  and  3  do  not  go  through  local  reflections  whereas  paths  2  and  4 
do  go  through  such  reflections.  Global  and  local  reflections  result  in  different 
interference  patterns. 
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Tx  antenna 


Rx  antenna 


Fig.  1.24    Two  paths  are  drawn  to  show  the  effect  of  interference  caused  by  global  reflectors 

1.7.1.1     Global  Reflection 

In  this  subsection  we  focus  on  large-scale  reflection.  Let  us  follow  the  signal  going 
through  path  1  and  path  3,  which  is  redrawn  in  Figure  1.24.  First  the  signal  going 
through  path  1  (with  distance  between  Rx  and  Tx  antenna  equal  to  d' )  will  arrive  at 
the  Rx  antenna  directly.  A  replica  of  this  signal  follows  path  3,  where  it  bounds  off  a 
global  reflector  (e.g.  a  hill)  before  it  arrives  at  the  Rx  antenna  (with  distance  between 
Rx  and  Tx  antenna  equal  to  d  ").  There  it  interferes  with  the  signal  that  goes  through 
path  1  at  the  Rx  antenna.  At  the  Rx  antenna  these  two  signals  will  have  a  phase 
difference  proportional  to  d'-d".  Specifically,  d'-d"  is  very  small  compared  with  X 
[4],  the  wavelength  of  the  carrier.  This  is  true  even  when  the  Rx  antenna  moves.  Thus, 
the  interference  is  always  destructive  in  nature  (d"  is  always  larger  than  d',  but  the 
difference  is  never  large  enough  to  cause  180°  phase  shift;  the  other  180°  phase  shift 
is  due  to  reflection).  In  addition,  because  the  phase  difference  is  small,  movement  of 
Rx  antenna  results  in  a  rather  small  variation  of  received  power.  Since  the  reflector  is 
due  to  global  objects  (like  a  hill),  which  rarely  move/change,  the  interference  is 
simple  in  nature.  In  summary,  the  interference  patterns  in  the  present  case  are  formed 
as  a  result  of  reflection  due  to  global  objects  and  are  deterministic  in  nature. 


1.7.1.2     Local  Reflection 


In  this  subsection  we  focus  on  small-scale  reflection.  Let  us  follow  the  signal  going 
through  path  1  and  path  2  in  Figure  1 .23,  which  is  redrawn  in  Figure  1 .25.  The  signal 
goes  along  path  1  from  Tx  antenna  to  Rx  antenna.  A  replica  of  this  signal  goes  along 
path  2.  Near  the  Rx  antenna  it  bounds  off  a  local  reflector  (e.g.  a  wall  right  next  to 
the  phone)  before  it  arrives  at  the  Rx  antenna.  There  it  interferes  with  the  signal  that 
travels  along  path  1.  Depending  on  where  Rx  antenna  is  situated  with  respect  to  the 
Tx  antenna,  this  interference  can  be  destructive  or  constructive,  resulting  in  varia- 
tion of  received  power.  In  addition,  since  the  phase  difference  can  be  large,  this 
results  in  large  variation  of  received  power.  Since  the  reflector  is  due  to  local 
objects  (like  a  door,  which  can  change,  or  another  person,  which  can  move), 
which  often  move/change,  the  interference  is  complicated  in  nature.  Also,  this 
variation  can  be  large.  In  summary,  the  interference  patterns  in  the  present  case 
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Path  2(local  reflected),  distance=d" 


X 


Local  reflector 


y  '*  < 

Path  1  (direct),  distance=d' 
Tx  antenna  Rx  antenna 


Ground,  global   reflector 
Fig.  1.25    Two  paths  are  drawn  to  show  the  effect  of  interference  caused  by  a  local  reflector 

are  formed  as  a  result  of  reflection  due  to  local  objects  and  are  random  (can  be 
constructive  or  destructive)  in  nature. 

Because  of  differences  in  interference  patterns  in  these  two  cases,  different  paths 
will  be  used  in  describing  their  different  channel  behaviour. 

1.7.2    Path  Loss:  A  First  Glance 

We  start  by  describing  path  loss.  We  are  interested  in  large-scale  propagation, 
usually  on  the  order  of  5X  to  50L  Hence  this  path  loss  is  described  by  interference 
effects  going  on  between  signals  propagating  through  paths  1,  2  and  paths  3,  4. 
To  describe  this  effect  Figure  1.23  has  been  redrawn  in  Figure  1.24.  As  discussed 
before,  only  path  1  (also  denoted  as  line-of-sight  [LOS]  path)  and  path  3  (also 
denoted  as  non-line-of-sight  [NLOS]  path)  from  Figure  1.23  are  shown.  Interference 
between  paths  1  and  4,  path  2  and  3,  path  2  and  4  is  supposed  to  show  similar 
characteristics.  Characteristics  of  the  path  loss  phenomenon  include  the  following: 

1.  Because  it  involves  large-scale  propagation,  we  are  interested  not  in  the 
instantaneous  power,  but  in  the  local  averaged  power.  However,  since  we  are 
interested  in  an  average  of  such  power  over  a  region  of  5X  to  50^,  we  can  assume 
a  constant  power  over  this  region,  with  its  value  set  equal  to  the  average. 

2.  Because  path  loss  is  attributed  to  interference  between  paths  1,2  and  paths  3,4, 
we  can  conclude  that  the  received  power  goes  down  as  transmitter/receiver 
separation  increases.  This  is  as  indicated  in  Figure  1.22.  Also,  the  effect  of 
movement  (both  due  to  the  mobile  and  the  reflectors)  is  averaged  out.  The  effect 
is  a  simple  loss  in  signal  strength  received  by  the  receiver.  Moreover,  the  signal 
loss  to  first  order  is  a  simple  function  of  distance,  and  not  of  angle  of  incidence 
and  other  factors.  Of  course  more  sophisticated  model  include  path  loss  due  not 
only  to  free  space  propagation  and  reflection,  but  also  diffraction  and  scattering. 

3.  Because  it  involves  large-scale  propagation,  the  number  of  relevant  paths  is 
small,  usually  boiled  down  to  one  or  two.  Hence  the  example  presented  in  Figure 
1 .24  turns  out  to  be  quite  representative  of  the  real-life  situation. 

Path  loss  induces  signal  loss  in  the  received  signal,  which  lowers  the  SNR 
and  hence  BER. 
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1.7.3    Multi-path  Fading:  A  First  Glance 

In  this  subsection  we  describe  multi  path  fading.  We  are  interested  in  small-scale 
propagation.  Hence  this  fading  is  described  by  interference  effects  going  on 
between  signals  propagating  through  paths  1  and  2  or  through  paths  3  and  4. 
To  describe  this  effect,  Figure  1.23  has  been  redrawn  in  Figure  1.25,  where  for 
simplicity  we  just  show  paths  1  and  2.  It  is  assumed  that  the  interference  effects 
between  paths  3  and  4  will  show  similar  characteristics. 

Characteristics  of  the  multi  path  phenomenon  include  the  following: 

1 .  Since  we  are  interested  in  the  instantaneous  power  available  to  the  receiver,  we  are 
interested  in  the  instantaneous  power  due  to  the  sum  of  path  1  and  path  2. 

2.  Because  signal  loss  is  attributed  to  interference  between  path  1  and  path  2,  we 
can  conclude  that  the  instantaneous  received  power  goes  up  and  down  with  large 
variation  as  transmitter/receiver  separation  increases.  This  is  as  indicated  in 
Figure  1.22. 

3.  Because  of  constructive  and  destructive  interference  of  the  carrier  from  multi 
paths,  we  can  have  the  following: 

a.  Fluctuation  of  the  resultant  signal's  carrier's  envelope's  strength.  This  change 
in  envelope's  strength  can  occur  rapidly  as  a  function  of  separation  and  is 
called  envelope  fading. 

b.  Distortion  of  the  shape  of  the  resultant  signal's  carrier's  envelope,  which 
leads  to  ISI  of  the  passband  signal.  The  distortion  of  this  envelope  depends  on 
the  relative  delay  of  the  various  reflected  signals.  Analyzing  such  various 
delays  works  like  analyzing  a  filter,  and  hence  the  radio  channel  can  be 
characterized  as  having  a  frequency  response.  Depending  on  whether  the 
frequency  response  of  the  radio  channel  is  constant  or  not  across  the  narrow 
band  the  signal  occupies,  we  have  flat/frequency  select  fading.  If  we  have  flat 
fading,  there  is  no  ISI,  and  if  we  have  frequency  selective  fading,  there  is  ISI. 

c.  Movement  (of  the  receiver  itself  or  local  reflector).  Such  a  change  would 
cause  phase  change  in  the  signal.  This  causes  another  effect,  called  Doppler 
shift.  If  the  channels  change  fast  enough,  we  call  the  channel  fast  fading. 
Hence  the  situation  can  also  be  classified  accordingly.  Depending  on  whether 
the  resultant  signal's  carrier's  envelope  changes  slowly/rapidly  with  symbol 
time,  we  call  it  slow  fade/fast  fade. 

4.  There  is  a  large  number  of  reflected  radio  waves  (much  larger  than  2)  whose 
amplitudes  are  random  and  time  varying  in  nature,  both  due  to  the  random  nature 
of  the  bit  sequences  and  the  movement  of  the  reflectors/scatterers.  Hence  to 
describe  and  examine  the  phenomeon,  a  full-blown  time-varying  statistical 
model  for  the  channel  must  be  invoked. 

Multi  path  fading  lowers  SNR  and  induces  ISI,  which  leads  to  a  reduced  BER. 
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1.8     Path  Loss:  Detailed  Discussion 
1.8.1     Friis  equation 

As  discussed  in  subsection  1.7.2  most  wireless  channels  have  both  a  LOS  and  a 
NLOS  propagation  path.  To  get  a  feeling  for  the  path  loss  in  such  an  environment, 
we  will  first  calculate  the  path  loss  along  the  LOS  path.  We  discuss  the  free  space 
propagation  loss  in  such  an  environment,  where  no  obstacles  occur  between  the 
transmit  and  the  receive  antenna. 

To  get  a  useful  estimation  for  the  path  loss,  we  will  make  use  of  fundamental 
antenna  theory.  Let  us  consider  first  an  isotropic  antenna;  namely,  one  that  radiates 
power  equally  in  all  directions.  Let  us  denote  as  PT  the  power  transmitted  by  the 
transmit  antenna  and  AR  the  capture  area  of  the  receive  antenna  spaced  r  meters 
apart.  Since  the  transmit  antenna  is  assumed  to  be  isotropic,  the  received  power 
captured  by  the  area  AR  is: 

P*-PT£k  (L18) 

Obviously,  in  practice  we  would  design  the  transmit  antenna  to  focus  its  radiated 
energy  in  the  direction  of  the  receiving  antenna.  Furthermore,  we  have  to  consider 
that  the  receive  antenna  does  not  actually  capture  all  the  electromagnetic  radiation 
incident  on  it.  Thus,  we  have  to  introduce  some  correction  factors  in 
equation  (1.18): 

P*  =  P^2GtVr  (1-19) 

where  Gt  is  the  transmit  antenna  gain  (to  account  for  the  focusing)  and  r\R  is  the 
antenna  efficiency  (to  account  for  incomplete  capturing). 

At  microwave  frequencies,  aperture  antennas  (i.e.  parabolic)  are  typically  used 
for  receiving,  and  for  these  antennas  the  achievable  antenna  gain  is  defined  as 

G  =  *fn  (1.20) 

where  A  is  the  antenna  capture  area,  X  is  the  wavelength  of  the  transmission,  and  r| 
is  the  antenna  efficiency.  We  observe  that  for  this  antenna  type  the  antenna  gain  is  a 
function  of  the  antenna  dimension  versus  the  carrier  frequency.  Accordingly,  the 
higher  the  carrier  frequency  the  lower  will  be  the  requirement  on  the  antenna 
dimensions  for  a  given  antenna  gain.  For  receiver  antenna  let  us  go  to  (1.20)  and 


32  1     Communication  Concepts:  Circuit  Designer  Perspective 

set  G  —  GR,  A  —  AR.  We  can  then  solve  AR  in  terms  of  GR.  Next  we  substitute  this 
expression  for  AR  in  (1.19)  and  we  get  a  relation  known  as  the  Friis  equation: 

^=G^(i)2=G«G^)2  (L21) 

Here  PR  is  the  power  received  by  the  antenna  with  an  antenna  gain  GR,  PT  is 
the  transmitted  power  with  an  antenna  gain  GT,  fc  is  the  carrier  frequency,  X  is  the 
corresponding  wavelength,  c  is  the  light  velocity,  and  r  is  the  spacing  between 
the  transmit  and  receive  antenna.  The  propagation  loss  LB,  expressed  in  decibels,  is 
now  given  by  expressing  the  Friis  equation  in  decibels: 

Lfl[<ffi]  =  101og10^ 

=  101og10Gs  +  101og10Gr-201og10/;.-201og10r+147.56dfi   (1.22) 

In  wireless  communications  we  usually  deal  with  carrier  frequencies  in  the  range 
of  MHz  to  GHz.  Equation  (1.22)  can  be  expressed  adopting  units  of  MHz  for  the 
frequency  term.  If  we  further  assume  we  have  omnidirectional  transmit  and  receive 
antennas  with  unity  gain,  (1.22)  becomes 

LB[dB]  =  21.56dB  -  20 log10/c [MHz]  -  201og10  -^  (1.23) 

From  this  basic  LOS  transmission  loss  equation,  we  note  that  the  received  power 
decreases  by  6  dB  for  every  doubling  of  distance  and  also  for  every  doubling  of 
the  radio  frequency.  Consequently,  the  more  we  increase  the  carrier  frequency,  the 
more  power  we  need  to  transmit  the  signal  reliably.  In  addition,  more  repeater 
stations  may  be  necessary.  Note  that  (1.23)  is  only  accurate  for  a  far- field  situation. 
The  behaviour  of  the  electromagnetic  field  near  the  transmit  antenna  is  difficult  to 
predict  theoretically,  and  its  models  are  mainly  based  on  experimental  data. 

Remember  that  so  far  we  have  only  considered  ideal  free  space  propagation  loss 
for  a  LOS  path.  Based  on  empirical  data,  a  fairly  general  model  has  been  developed 
for  NLOS  propagation  paths.  This  model  is  given  as 

/    7  \  -n 

LA{d)    ex    LB  ■  (-)  (1.24) 

Here  n  is  the  path  loss  exponent,  which  indicates  how  fast  path  loss  increases 
with  distance;  do  is  the  reference  distance  for  free  space  propagation  (unobstructed 
transmission  distance);  LB  is  the  corresponding  propagation  loss  of  the  LOS  path 
[refer  to  (1.22)];  and  d  is  the  distance  between  the  transmit  and  the  receive  antenna. 
The  NLOS  path  loss  can  therefore  be  approximated  by  (1.24)  as 

LA[dB]  =  LB  -  10 •  n  ■  log10  (j\  (1.25) 
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Experimental  results  indicate  that  typical  NLOS  outdoor  cellular  mobile  systems 
have  a  path  loss  exponent  n  between  3.5  and  5  and  that  indoor  systems  have  a  path 
loss  exponent  between  2  and  4. 

As  discussed  in  subsection  1.7.2,  because  of  reflection,  the  LOS  path  tends  to 
cancel  out  (destructively  interfere  with)  the  NLOS  path,  making  the  loss  larger  than 
in  the  free  space,  and  hence  the  exponent  n  larger  than  2.  Again,  as  mentioned  in 
subsection  1.7.2,  because  we  are  talking  about  large-scale  propagation,  the  path 
difference  d'-d"  is  small  compared  with  the  path  distance  itself;  hence  the  cancella- 
tion is  rather  constant,  or  independent  of  carrier  wavelength.  To  reiterate,  this  is 
different  from  the  small-scale  propagation  case.  Moreover,  because  of  this  rather 
constant  cancellation,  the  interference  manifests  itself  as  amplitude  loss,  but  not  ISI. 


1.8.2    Amplitude  Loss  and  Minimum  Received  Signal  Strength 

In  general,  (1.23),  (1.24),  and  (1.25)  can  be  used  to  predict  amplitude  loss  and 
minimum  received  signal  strength  for  a  given  wireless  channel  and  a  given 
standard.  Let  us  present  an  example. 


1.8.2.1     Numerical  Example  1.2 

For  a  DECT  receiver,  let  us  first  assume  that  the  transmit  and  the  receive  antennas 
are  separated  by  d  =  50  m  (from  Table  1.1  this  is  the  minimum  cell  radius).  Let  us 
further  assume  that  the  channel  has  a  free  space  loss  with  d0  =  3m  and  that  the  path 
loss  exponent  n  is  3.  This  would  be  a  real-life  situation  for  an  indoor  channel. 
Finally,  we  assume  that  we  use  omnidirectional  antennas  with  unity  gain. 
We  start  off  with  (1.23)  to  find  the  LOS  loss: 

3/77 

LB(d0)  =  27.56dB  -  201og10  1900M//z  -  201og10  —  =  -38 dB  (1.26) 

3m 

Then  for  the  present  environment,  we  substitute  n  =  3,  <i0=3m,  and  LB  obtained 
from  (1.26)  into  (1.25)  and  the  amplitude  loss  is 

/50m\ 

LA{dB)  =   -38 JB-  10-3-log10  I—  j  = -lAdB  (1.27) 

However,  we  are  really  interested  in  the  maximum  amplitude  loss.  To  calculate 
that,  we  assume  the  transmit  and  the  receive  antennas  are  separated  by  d  =  400  m 
(from  Table  1.1  this  is  the  maximum  cell  radius).  In  this  case  (1.27)  becomes 

/400m\ 

LA(dB)  =  -38dB-10-3-log10 =  -10WB  (1.28) 

\  3m  J 
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From  Table  1 . 1  the  maximum  transmit  power  PT  was  set  to  250  mW  (=24  dBm). 
Thus  the  received  signal  strength  for  the  present  example,  which  corresponds  to  the 
minimum  received  signal  strength,  is  approximately 

PRmin  =  24dBm  -  lOldB  =  -  11  dBm  (1.29) 


1.8.3    Minimum  Separation 

The  Friis  equation  derived  previously  [(1.21)]  also  allows  us  to  determine  the 
minimum  separation  between  users  for  a  given  interference  level.  Let  us  assume 
this  time  that  we  have  two  users,  users  1  and  2.  User  1  receives  power  from  the  base 
station.  In  addition,  user  1  also  receives  power  transmitted  by  user  2,  which  acts  as 
interference.  We  would  like  to  estimate  this  interference's  power  level.  The  inter- 
ference propagates  from  user  2  to  user  1 ,  following  the  same  path  loss  mechanism. 
From  the  Friis  equation  we  know  that  as  the  separation  between  the  two  users 
decreases,  the  received  interference's  power  level  increases.  Hence  to  calculate  the 
maximum  interference's  power  level  received  by  user  1,  we  first  assume  a  LOS  path 
and  apply  (1.21)  again 

%^  =  G*Gr(— ^V  (1.30) 

Here  PRmax  is  the  maximum  allowable  interference  power  level  received  by  user  1, 
PT  is  the  corresponding  power  level  transmitted  by  user  2,  dmin  is  the  minimum 
separation  between  user  1  and  user  2,  and  GR  and  GT  are  the  receive  and  transmit 
gain  of  users  1  and  2,  respectively. 

Thus,  solving  for  <imin  in  (1.30)  gives 


PtGrGt      C        1 

dmiB=J^^^-7--  (1.31) 

V     f«max       fc     471 

The  dmin  calculated  for  an  NLOS  radio  path  is  related  to  the  dmin  calculated  for  a 
LOS  path  in  the  same  way  as  LA(NLOS)  is  related  to  LB(LOS).  Hence  we  apply 
(1.24)  to  (1.31)  and  we  get 


•*min,M.0S 


PTGRGT(l/4nd0)2 


Rmax 


l/« 

■d0  (1.32) 
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where  d0  is  the  free  space  distance,  X  is  the  wavelength  of  the  carrier  signal  and  n  is 
the  path  loss  exponent.  Accordingly,  (1.32)  allows  us  to  determine,  for  a  maximum 
interference  power  level  P«max,  the  minimum  separation  that  user  2  can  be  from 
user  1 ,  with  a  NLOS  path. 


1.9     Multipath  Fading:  Channel  Model  and  Envelope  Fading 

In   this    and   the    following    section    we    will   present   a   detail    discussion    of 
multipath  fading. 


1.9.1     Time-varying  Channel  Model 

In  subsection  1 .7.3,  we  stated  that  to  describe  multipath  fading  we  need  a  full-blown 
time-varying  statistical  model  for  the  channel.  To  describe  a  time-varying  channel 
let  us  start  by  assuming  that  the  input  to  the  channel  is  a  passband  PAM  signal. 
This  passband  PAM  signal  is  obtained  by  modulating  a  baseband  PAM  signal  on  a 
high-frequency  earner.  The  baseband  PAM  signal  is  a  sequence  of  pulses,  which  is 
in  turn  obtained  by  amplitude  modulating  the  symbols  on  a  given  pulse  shape. 
Alternately,  we  can  view  the  passband  PAM  signal  as  consisting  of  two  sinusoidal 
carriers  at  the  same  frequency  (90°  out  of  phase)  that  are  modulated  by  the  real  and 
imaginary  parts  of  a  complex  valued  baseband  signal.  As  a  side  note,  the  passband 
PAM  scheme  can  be  specialized  to  become  the  PSK,  FSK,  and  QPSK  modulation 
schemes  described  previously.  There  are  different  representations  of  a  passband 
PAM  signal.  One  such  possible  representation  is  the  following: 

xT(t)  =  V2Re[s,(t)  ■  e™']  (1.33) 


V2Re 


\^     A     aft  -  mT,  \  .  J2*?*1 


Amg{t-mTh)  -eJ 


(1.34) 


Here  xT(t)  is  the  transmitted  PAM  signal,  s/(t)  is  the  baseband  PAM  signal,  fc  is 
the  carrier  frequency,  Am  is  a  coded  sequence  of  symbols,  g(t)  represents  the  pulse 
shape  of  this  sequence,  and  Th  is  the  symbol  period. 

Next  let  us  apply  this  passband  PAM  signal  to  the  channel.  As  stated  previously, 
since  the  atmosphere  is  inhomogeneous  to  electromagnetic  radiation  due  to  spatial 
variations  in  temperature,  pressure,  humidity,  and  turbulence  or  simply  due  to  the 
fact  that  the  mobile  unit  can  be  in  motion  in  a  wireless  environment  during 
transmission,  the  channel  has  to  be  modeled  to  be  time-variant.  Furthermore,  we 
have  already  noted  that  as  the  most  common  application  of  wireless  communication  is 
in  urban  areas,  there  are  many  local  obstacles  to  radio  transmission  and  many 
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opportunities  for  multiple  reflections.  As  a  result  of  changes  occurring  in  these  local 
reflectors,  the  resulting  multiple  propagation  paths  have  propagation  delays  and 
attenuation  factors  that  are  time  variant.  Thus,  the  received  signal,  xR(t),  may  be 
expressed  in  the  form 


Xr 


W=£«*('M'-Tf(0) 


(1.35) 


where  a,(?)  is  the  attenuation  factor  for  the  signal  on  the  ;-th  path  with  a  delay  t,(0- 
Substituting  Xj{t)  from  (1.33)  into  (1.35)  yields 


xR(t)  =  V2Re 


lj2«®s'(t-x>®)-e~™Mii}-eiW't 


(1.36) 


Alternately,  we  can  substitute  the  expanded  version  of  xT(t)  from  (1.34)  into 
(1.35)  and  we  get 


xR(t) 


V2Re   I  J2  an(t)  ^Amg{t  -  mTh  -  t;(f))  ■  e 
K    '  m 


-jinfM')  I  _  ej27lfil 


(1.37) 


To  gain  more  insight,  let  us  assume  that  we  only  focus  on  the  first  bit.  Accordingly, 
(1.37)  can  be  simplified  by  setting  m  =  0.  The  resulting  signal xRm=0  is: 


XRtm=o(t)  =  V2Re 


]IXi(t)-e-fl*<-"®A0g(t-Xi(t))\-etb** 


(1.38) 


Let  us  start  from  (1 .38).  We  observe  that  there  are  three  potential  problems  in  the 
multipath  fading  channel: 

1 .  Referrring  to  ( 1.38),  xR  „1=0  is  seen  to  consist  of  the  sum  of  a  number  of  time-variant 
vectors  (phasors)  due  to  the  multipath,  each  having  phases  (/>,■(  t)  =  2nfcTj(t).  Hence 
a  significant  change  in  the  amplitude  of  xR  m=0  will  occur  if  </),(?)  changes  by  360°, 
because  the  carrier  of  the  original  signal  and  the  carrier  of  the  replica  signal  will 
now  interfere  destructively.  Thus  </),(0  will  change  by  360°  when  x;(r)  changes 
by  l/fc.  Since  l/fc  is  a  small  number  (in  DECT  it  is  on  the  order  of  nanoseconds), 
<pi(t)  can  change  by  360°  with  relatively  small  change  in  the  separation  distance 
between  the  Rx  and  Tx  antennas.  Hence  the  amplitude  of  xR  „,—0  fluctuates  rapidly 
as  separation  changes,  as  stated  qualitatively  in  subsection  1.7.3.  This  is  known  as 
envelope  fading  and  will  be  explained  in  more  detail  in  subsection  1 .9.2. 

2.  Surprisingly  if  we  compare  (1.38)  with  (1.15),  we  note  the  similarity,  even 
though  (1.15)  is  derived  for  a  classical  channel.  In  (1.15)  the  received  signal 
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consists  of  the  desired  transmitted  symbol,  A0,  plus  interference  caused  by 
symbols  transmitted  before  and  after  symbol  period  0  (second  term).  In  (1.38) 
the  received  signal  contains  the  desired  transmitted  symbol,  A0,  modulated 
bye-*2^',  (the  carrier),  plus  interference  caused  by  delayed  version  of  this  same 
transmitted  symbol,  A0,  again  modulated  by  e-'27^',  (the  carrier).  Now  following 
(1.15)  we  have  explained  how  this  interference  causes  ISI  in  the  received 
baseband  signal.  We  can  apply  the  same  argument  and  conclude  that  the 
received  passband  PAM  signal  in  (1.38)  also  suffers  from  ISI.  We  call  this 
frequency  selective  fading,  which  will  be  explained  in  more  detail  in  subsection 
1.10.1. 
3.  If  the  carrier  frequency  in  (1.38)  changes  with  time  due  to  movement  of  the 
mobile,  this  will  cause  xR(t)'s  phase  to  change.  We  call  this  fast  fading,  and  it 
will  be  explained  in  more  detail  in  subsection  1.10.2. 


1.9.2    Envelope  Fading 

In  the  discussion  following  (1.38),  we  stated  that  the  received  passband  PAM  signal 
suffers  from  envelope  fading.  We  also  stated  in  subsection  1.7.3  that  there  is  a  large 
number  of  paths  involved  in  multipath  fading.  Hence  it  is  best  to  describe  the 
amplitude  of  the  reflected  signals  along  different  paths  with  a  probability  distribu- 
tion function.  In  fact,  such  amplitude  variation  also  shows  up  in  the  amplitude 
variation  of  the  resulting  envelope.  Hence  we  would  focus  on  the  pdf  that  describes 
the  envelope's  amplitude.  It  will  be  shown  that  the  pdf  of  the  amplitude  of  this 
envelope  can  be  modeled  accurately  by  a  Rayleigh  distribution.  Therefore,  this  type 
of  fading  is  also  known  as  Rayleigh  fading.  Using  such  a  model,  fades  of  20  dB 
below  the  root  mean  square  (rms)  value  of  the  signal  envelope  occur  approximately 
1%  of  the  time. 


1.9.2.1     Rayleigh  Distribution 


To  show  the  amplitude  of  the  envelope  follows  a  Rayleigh  distribution,  we  start 
from  (1.36),  which  is  first  repeated  here  and  then  expressed  in  terms  of  A;(t)  : 


Y,M*)  ■  eMt)\  ■  eflnfct 


(1.39) 


Here  At{i)  represents  the  attenuated,  delayed  baseband  signal  and  (/>,(?)  is  the 
phase  of  the  i  th  path. 
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We  assume  that  the  mobile  unit  is  stationary.  Then  the  phase  shift  introduced  by 
the  channel  for  the  i  th  path  can  be  expressed  as 

C    S '  s  ■ 

4>i  =  2nfcti  =  In-  ■  —  =  2n4-  (1-40) 

AC  A 

where  X  is  the  wavelength  of  the  carrier  signal  and  sf  is  the  additional  distance 
traveled  by  the  reflected  wave  of  the  i  path  with  respect  to  the  direct  path. 

Since  s,-  are  random,  then  (/>,  are  random  phases.  We  further  assume  they  are 
uniformly  distributed  from  0  to  271.  Writing  the  complex  exponents  in  (1.39)  in 
terms  of  their  real  and  imaginary  components,  we  get  an  expression  of  xR(t)  in  terms 
of  the  in-phase  and  the  quadrature  components: 

xR{i)  =  s,(t)  cos(2nfct)  -  sQ(t)  sin(2nfct)  (1-41) 

where 

s7(f)  =  X)A'-(OcosGft)  (1-42) 

i 

sG(t)  =X>(0sinfa)  (1.43) 

i 

Notice  that  by  the  central  limit  theorem  the  baseband  random  process  Si(t)  and 
SQ(t)  are  approximately  Gaussian,  since  the  different  paths  are  supposed  to  be 
identical  and  independently  distributed  in  terms  of  their  summation.  Hence  the 
envelope  of  the  received  signal  xR{t)  is  given  by 

Mt)em,elope=  \]  ^f+^f  (1-44) 

and  its  amplitude's  pdf  is  a  Rayleigh  distribution. 


1.9.2.2     Increased  SNR  Requirement 

In  this  subsection  we  rederive  the  relationship  between  SNR  and  BER  that  was  first 
derived  in  subsection  1.6.1  for  a  classical  channel,  except  this  time  with  multipath 
fading  the  received  signal's  amplitude  is  no  longer  constant  but  is  given  by  a  distribu- 
tion. We  show  how  this  amplitude  fiuctutation,  due  to  envelope  fading,  increases  the 
SNR  required  at  the  demodulator  input.  For  envelope  fading,  from  (1.41)  through 
(1.44),  we  are  again  interested  in  baseband's  amplitude  variation.  Hence  we  can  start 
from  (1.36)  and  eliminate  the  e^w  term  and  obtain  the  baseband  term.  To  simplify 
matters,  we  just  develop  envelope  fading  on  a  flat  fading  channel  [4].  Envelope  fading 
on  frequency  selective  channels  is  difficult  to  calculate  and  requires  computer 
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simulation.  Since  ISI  no  longer  exists,  (1.36)  can  be  simplified  without  the  e-^w  term 
and  becomes 

XR,baSeband{t)  =  a(t)si(t)exp(-jd(t))  (1.45) 

Here  a(t)  is  the  gain  of  the  channel  and  9(?)  is  the  phase  shift  of  the  channel. 
Observing  (1.45),  we  can  see  that  multiple  path  fading  manifests  itself  as  a 
multiplicative  (gain)  variation  in  the  transmitted  signal  Si(t). 

If  we  incorporate  AWGN  in  the  channel,  we  have 

XR,baseband(t)  =  Cc(t)sl(t)exp(-jO(t))  +  n(t)  (1.46) 

where  «(f)  is  AWGN  of  the  channel. 

To  evaluate  the  probability  of  error,  for  a  modulation  scheme,  we  can  follow  the 
same  procedure  as  in  subsection  1.6.1,  except  that  we  have  to  recognize  that  the  A 
term  in  (1.5)  is  no  longer  a  constant.  To  overcome  this  we  can  use  the  concept  of 
conditional  probability.  First  the  probability  of  an  event  of  making  an  error,  given  the 
condition  that  the  amplitude  assumes  a  specific  value,  Aspeciflc,  is  the  joint  probability 
of  two  subevents:  (1)  the  probability  of  an  event  that  the  amplitude  assumes  that 
specific  value,  Aspeciflc,  and  (2)  the  probability  of  the  event  that  either  Aspecific  -  n(Th) 
becomes  less  than  zero,  or  -  Aspec;fiC  +  n(Th)  is  larger  than  zero.  The  probability  of 
this  subevent  (2)  obviously  depends  on  the  value  of  Aspeciflc  and  the  standard  deviation 
of  n(Th).  Since  the  two  subevents  are  assumed  to  be  independent,  the  joint  probability 
is  just  the  product  of  the  probability  of  the  two  subevents.  The  total  probability,  Pe,  is 
now  the  integral  of  all  these  joint  probabilties.  Mathematically  it  is  written  as 


Pe  = 


p{X)Pe{X)dX  (1.47) 


Here  p(X)  is  the  pdf  of  X  due  to  fading,  which  is  the  probability  of  subevent  (1); 
Pe(X)  is  the  probability  of  error  at  a  specific  SNR=X  where  X  is  given  by  X^ct.2Eh/ 
N0;  a  is  the  gain,  as  given  in  ( 1 .45),  Eh  and  N0  have  been  defined  in  subsection  1.6.1. 
Hence  Pe(X)  can  be  interpreted  as  the  probability  of  subevent  (2). 

Notice  that  the  variable  a  captures  the  amplitude  fluctuation  due  to  fading, 
normalized  with  respect  to  Eh/N0.  For  most  flat  fading,  the  probability  distribution 
of  the  amplitude  follows  a  Rayleigh  distribution,  as  described  in  subsection  1.9.2.1. 
Therefore,  a  is  given  by  a  Rayleigh  distribution.  Hence  a2  and  therefore  X  follows  a 
chi-square  distribution  given  by 

KX)=jUxp(-*J  (1.48) 
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Here 

r  =  ^  (i.49) 

and  can  physically  be  interpreted  as  the  average  SNR. 

Of  course,  Pe (X)  is  a  function  of  the  modulation  scheme.  As  an  example,  let  us 
calculate  the  total  probability  corresponding  to  GMSK,  fpGMSK-  We  apply  (1.48), 
and  P e(X)  corresponding  to  GMSK  to  (1.47)  and  we  get 

where  §  is  a  variable  that  depends  on  BTh  Here  B  is  the  99%  bandwidth  of  the 
Gaussian  pulse  and  is  defined  to  be  the  bandwidth  where  99%  of  the  pulse  energy  is 
contained.  Th  is  the  symbol  period.  For  example  8=0.68  for  BTh  =0.25  and 
0.84  for  BTh  =oo. 

Numerical  Example  1.3 

From  Table  1.1,  DECT  uses  a  GMSK  with  BTh  =0.3.  We  can  take  that  to  be  close 
enough  to  0.25  and  set  8=0.68.  Again  from  Table  1.1,  BER(=Pe)  for  DECT=10"3. 

Substituting  these  into  (1.50),  and  we  havel0~3  =  4x06^xWg. 

Solving  SNR  =  367,  or 

SNR  =  25dB  (1.51) 

Comparing  (1.51)  with  (1.26),  notice  that  the  present  SNR  is  16  dB  higher  than 
that  required  for  AWGN  alone. 

For  other  modulation  schemes,  similar  equations  have  been  derived  [4].  Some  of 
these  plots  are  shown  in  Figure  1.26. 


1.9.2.3     Frequency  and  Space  Diversity 

The  SNR  derived  previously  to  combat  amplitude  fluctuation  due  to  envelope 
fading  is  rather  large.  Instead  of  using  such  a  large  SNR,  we  can  achieve  the 
same  BER  by  using  diversity  technique.  There  are  basically  two  diversity 
techniques  that  improve  the  quality  of  the  received  signal  (degraded  due  to  the 
envelope's  amplitude  fluctuation)  and  hence  help  reduce  the  required  SNR: 

1.  The  frequency  diversity  approach.  Frequency  diversity  simply  involves 
modulating  two  different  RF  carrier  waves  with  the  same  baseband  signal  and 
then  transmitting  both  the  RF  signals  to  the  same  receiver.  If  two  carrier  waves 
use  two  different  frequencies  and  if  they  are  separated  by  more  than  (A/)£.,  then 
both   information-bearing   signals   are   affected   differently   by   the   channel. 
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Fig.  1.26    BER  as  a  function 
of  SNR,  envelope  fading 


BER 


10-' 


10- 


10"( 


io-' 


GMSK 


BPSK 


10 


20 
SNR  (dB) 


30 


The  coherence  bandwidth,  (Af)c,  will  be  defined  in  subsection  1.10.1.4.  At  the 
receiver,  both  RF  signals  are  demodulated,  and  the  one  that  yields  a  better- 
quality  baseband  signal  is  selected. 

2.  The  space  diversity  approach.  In  this  approach,  at  the  receiver  there  is  more  than 
one  antenna  providing  the  input  signal  to  the  receiver.  It  is  important  that  the 
antennas  are  separated  by  multiple  wavelengths  of  the  carrier  signal.  This  is  to 
ensure  that  two  or  more  signals  of  the  same  frequency,  which  are  attenuated  in  the 
same  way  by  the  channel,  are  in  phase  and  additive.  If  received  out  of  phase,  they 
will  cancel  and  consequently  result  in  less  received  signal  power  than  if  simply  one 
antenna  system  was  used.  Note  that  both  diversity  techniques  can  be  combined. 

3.  OFDM  is  an  example  applying  diversity  technique. 


1.10    Multipath  Fading:  Frequency  Selective  and  Fast  Fading 

Now  that  we  have  presented  how  multipath  fading  reduces  amplitude,  we  will  show 
how  it  also  introduces  ISI.  In  addition,  we  discuss  how  it  can  reduce  amplitude  in  a 
time-varying  fashion. 


1.10.1     Frequency  Selective  Fading 


We  stated  in  the  discussion  following  (1.38)  that  under  multipath  fading  the  received 
passband  PAM  signal  suffers  from  ISI.  To  see  how  ISI  is  created,  we  need  to  look  at 
the  baseband  component  of  this  passband  PAM  signal.  We  could  have  gone  to  (1.37) 
and  started  to  extract  the  baseband  component  from  the  passband  PAM  signal 
directly.  This  is  mathematically  complex.  Traditionally  this  complex  mathematical 
problem  is  handled  using  tools  such  as  correlation  function,  which  may  be  less  familiar 
for  circuit  designers  (refer  to  [1]  for  such  a  treatment).  Alternately  (which  is  the 
approach  adopted  in  this  chapter),  we  can  start  from  the  channel  model  described 
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in  Section  1.9,  [(1.34)  to  (1.38)],  but  we  take  the  carrier  term  out.  The  resulting 
equations  are  as  follows: 

xT{t)=Si{t)  (1.52) 

xg(*)  =Y^*i{t)xT{t-  Xi{t))  (1.53) 

i 

xR{t)=Y^ai{t)sl{t-xi{t))  (1.54) 

i 

XR{t)  =  Y^  aM  Yl  Amg{t  ~  mTh  -  %i{t))  (1.55) 

i  m 

XR,„,=o(t)  =  J2  «*W  •  e^^A0g(t  -  t,(0)  (1.56) 

i 

Hence,  for  example,  comparing  (1.55)  to  (1.37),  the  delay  due  to  the  channel 
now  is  applied  to  the  baseband  signal  (symbol)  only,  rather  than  to  the  carrier  and 
the  baseband  signal  (symbol).  Of  course  this  is  only  an  approximation.  However, 
the  advantage  of  this  model  is  that  now  in  order  to  observe  the  effect  of  the  channel, 
we  need  to  apply  attenuation  and  delay  associated  with  different  paths  to  the 
baseband  signal  (symbol)  only.  Hence  we  can  concentrate  our  discussion  only  on 
the  baseband  signal.  With  this  in  mind,  let  us  examine  an  example  to  illustrate  how 
ISI  is  caused  by  multipath  fading. 

1.10.1.1     An  Illustrative  Example 

In  this  example  we  make  the  following  assumptions: 

1.  g(t),  the  baseband  pulse  used  in  the  channel  [refer  to  (1.55),  and  (1.56)]  is 
implemented  using  a  sine  pulse. 

2.  We  decide  to  send  2  bits:  a  logic  0  followed  by  a  logic  1.  Hence  in  (1.55), 
A0=0  and  A;=l. 

3.  To  simplify  matters,  we  assume  a;  =  1  and  ii=0;  that  is,  path  1,  the  direct  path, 
suffers  no  attenuation  and  no  time  delay. 

4.  Th,  symbol  period,  is  set  to  2  ms. 

First,  let  us  investigate  a  channel  [represented  by  a,,  x,  in  (1.53)]  that  consists  of 
three  paths  (i.e.  i  =  1,2,  3),  shown  in  Figure  1.27.  It  consists  of  one  direct  path  and 
two  reflected  paths,  which  have  specific  a,-,  x,. 

To  see  how  ISI  is  created  in  this  channel  and  also  how  severe  this  ISI  is,  let  us 
look  at  the  following  two  cases: 

1.  Focus  on  path  1  and  path  3:  Set  x3  =  0.01  ms,  a3  =  0.8.  Therefore,  attenuation  and 
delay  of  path  3  are  rather  small.  The  resulting  pulses  are  shown  in  Figure  1 .28a. 
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Fig.  1.27  Illustration  of  a  channel  where  there  exists  a  path  (path  2)  with  both  a  and  x  large 
(a)  Baseband  PAM  signal  after  multipath  reception,  where  ISI  does  not  occur,  since  reflected 
signal  is  not  delayed,  (b)  Baseband  PAM  signal  after  multipath  reception,  where  ISI  occurs,  since 
the  reflected  signal  is  delayed  and  is  strong 


These  pulses  are  obtained  by  going  through  the  following  reasoning:  the  A]  pulse 
starts  at  symbol  time  of  1  ms  with  a  peak  value  of  1,  goes  through  path  1,  suffers 
no  attenuation  and  no  delay,  and  arrives  at  the  Rx  antenna  at  1  ms.  Meanwhile, 
this  same  A  i  pulse,  which  starts  at  1  ms  with  a  peak  value  of  1,  also  goes  through 
path  3,  gets  attenuated,  and  results  in  a  pulse  having  a  peak  value  of  0.8. 
Furthermore,  it  gets  delayed  by  0.01  ms  and  arrives  at  the  Rx  antenna  at  1.01 
ms.  Hence  at  1  ms,  the  Rx  antenna  receives  two  pulses,  which  add  to  one 
another.  ISI  is  not  severe. 
2.  Focus  on  path  1  and  path  2:  We  set  x2  =  2  ms  (=Th),  a2  =  0.8.  We  would  want 
to  see  when  a  signal  goes  through  such  a  path,  which  suffers  little  attenuation 
even  with  a  long  delay,  what  happens  to  the  pulses.  The  resulting  pulses  are 
shown  Figure  1.28  (b).  These  pulses  are  obtained  by  going  through  the  follow- 
ing reasoning:  the  AQ  pulse  starts  at  -1  ms  with  a  peak  value  of  —1,  goes 
through  path  2,  gets  attenuated  to  a  peak  value  of  -0.8,  and  gets  delayed  by 
2  ms  and  arrives  at  the  Rx  antenna  at  1  ms.  Meanwhile,  the  A;  pulse,  which 
starts  at  the  next  symbol  time  of  1  ms  and  has  a  peak  value  of  1 ,  goes  through 
path  1 ,  suffers  no  attenuation  and  no  delay,  and  arrives  at  the  Rx  antenna  at  1 
ms.  Hence  at  1  ms,  the  Rx  antenna  receives  two  pulses,  which  cancel  with  one 
another.  ISI  is  severe.  (Note:  the  A0  pulse,  which  starts  at  —1  ms  with  a  peak 
value  of  —  1 ,  also  goes  through  path  1 ,  suffers  no  attenuation  and  no  delay,  and 
arrives  at  the  Rx  antenna  at  -1  ms.  As  shown,  this  pulse  still  has  some  residual 
energy  around  1  ms.  However,  because  this  is  a  sine  pulse,  at  exactly  1  ms  this 
pulse  crosses  zero  and  therefore  does  not  interfere  with  the  A[  pulse  that  has 
gone  through  path  1 .) 

The  preceding  conclusions  on  how  severe  ISI  is  can  also  be  seen  by  plotting  the 
eye  diagram  for  the  two  different  cases,  shown  in  Figure  1.28c. 

Basically,  the  preceding  discussion  highlights  the  fact  that  for  this  channel  where 
there  exist  paths  (path  2  in  Figure  1 .27)  that  exhibit  large  path  delays  x,  and  large 
attenuation  factor  a,-,  strong  ISI  will  occur. 
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Fig.  1.28a, l>  (a)  Baseband  PAM  signal  after  multipath  reception,  where  ISI  does  not  occur,  since 
reflected  signal  is  not  delayed,  (b)  Baseband  PAM  signal  after  multipath  reception,  where  ISI 
occurs,  since  the  reflected  signal  is  delayed  and  is  strong 
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Fig.  1.28c  Simplified  eye  diagram  for  comparisons  on  paths  1 ,  3  and  paths  1 ,  2  for  the  channels 
described  in  Figure  1.27.  I)  No  ISI  when  the  delay  of  the  reflected  path  is  small  compared  to  the 
symbol  interval  Th  or  when  attenuation  is  large.  II)  Severe  ISI  when  the  delay  of  the  reflected  path 
is  approximately  half  the  symbol  interval  T/7  while  attenuation  is  small 


Now  let  us  investigate  those  channels  whose  paths  exhibit  large  path  delays  t, 
but  small  attenuation  factor  a,.  We  will  show  that  ISI  is  not  severe  for  these 
channels.  As  an  example  of  such  a  channel,  let  us  modify  the  channel  described 
in  Figure  1.27  by  changing  the  a2  associated  with  path  2.  i2,  (<Xi  Ti),  (a3  x3)  remain 
the  same.  The  resulting  channel  is  shown  in  Figure  1.29.  To  see  how  severe  ISI  is 
with  this  channel,  let  us  do  the  following: 

1.  Focus  on  path  1  and  path  3.  Since  everything  is  the  same  as  in  Figure  1.27,  ISI  is 
not  severe. 

2.  Focus  on  path  1  and  path  2:  we  set  x2  =  2  ms  (—Th),  a2  =  0.08.  Notice  that  the 
attenuation  for  path  2  is  severe.  The  resulting  pulses  are  shown  in  Figure  1.30. 
These  pulses  are  obtained  by  going  through  the  following  reasoning:  the  AQ  pulse 
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Fig.  1.29    Channel  is  modified  from  that  in  Figure  1 .27.  Specifically,  path  2  is  modified  so  that  ot  is 
small 
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Fig.  1.30    Baseband  PAM  signal  after  multipath  reception  for  channel  in  Figure  1 .29.  ISI  does  not 
occur  since  reflected  signal  is  weak 


starts  at  -1  ms  with  a  peak  value  of  —  1,  goes  through  path  2,  gets  attenuated  to  a 
peak  value  of -0.08,  gets  delayed  by  2  ms  and  arrives  at  the  Rx  antenna  at  1  ms. 
Meanwhile,  the  Ax  pulse  starts  at  the  next  symbol  time  of  1  ms  with  a  peak  value 
of  1,  goes  through  path  1,  suffers  no  attenuation  and  no  delay,  and  arrives  at  the 
Rx  antenna  at  1  ms.  Hence  at  1  ms,  the  Rx  antenna  receives  two  pulses,  which 
hardly  cancel  with  one  another.  ISI  is  not  severe. 

Hence  the  channel  described  in  Figure  1 .27  exhibits  severe  ISI  while  the  channel 
described  in  Figure  1 .29  does  not.  We  would  now  want  to  generalize  this  to  a  typical 
channel  that  contains  many  more  paths.  We  would  conclude  that  for  a  channel  that 
has  a  substantial  portion  of  its  paths  exhibiting  large  path  delays  x,  accompanied  by 
large  attenuation  factors  a,-,  strong  ISI  will  occur. 
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1.10.1.2     Criteria  of  a  Flat/Frequency  Selective  Channel 

We  can  elaborate  the  preceding  conclusions  further.  If  all  the  reflected  signals  that 
arrive  have  their  amplitude  attenuated  at  a  rate  that  is  fast  enough  that  they  does  not 
cause  ISI,  then  this  is  a  good  channel.  This  rate  is  crudely  defined  as  follows:  the 
amplitude  must  be  attenuated  to  below  a  certain  threshold  when  the  delay  becomes 
comparable  to  the  symbol  period  Th.  A  channel  that  has  this  characteristic  is  called  a 
flat  channel.  Hence  the  channel  described  in  Figure  1.29  is  a  flat  channel.  Otherwise 
it  is  a  frequency  selective  channel  that  suffers  from  frequency  selective  fading. 
The  channel  described  in  Figure  1 .27  is  a  frequency  selective  channel. 

The  preceding  condition  can  be  restated  as  the  following  condition:  delay  for 
paths  possessing  strong  reflection  must  be  much  smaller  than  T/,.  As  stated  in 
subsection  1.7.3,  the  characteristic  of  multipath  fading  is  that  a  large  number  of 
paths  are  involved.  Hence  it  is  best  to  describe  their  delay  and  amplitude  with  a 
probability  distribution  function.  We  will  then  plot  amplitude  of  the  paths  versus 
their  respective  delay  and  define  a  parameter,  x„„  called  time  delay  spread. 
The  condition  for  having  a  flat  channel  using  this  parameter  then  becomes 

rm«Tb  (1.57) 


1.10.1.3     Time  Delay  Spread 

We  now  discuss  i„„  the  time  delay  spread.  First  let  us  plot  the  pdf  of  x,  for  a  typical 
channel  (Figure  1.31). 

For  channel  described  in  Figure  1 .27  we  have  in  =  0(path  1),  ii  =  0.01  ms(path3), 
and  x2  =  2  ms  (path  2).  Next  we  want  to  plot  the  amplitude  factor  a,  versus  the  delay 
factor  i,.  We  can  give  the  following  first-order,  qualitative  observations  that  relate  the 
two  factors. 

1.  We  assume  that  signal  attenuation  comes  mainly  from  absorption  of  signal 
power  that  occurs  during  reflections.  Hence  we  can  observe  that  for  a  given 
path,  a,  decreases  as  the  number  of  reflections  increases. 
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Fig.  1.32    Attenuation  factor 
of  paths  are  plotted  against 
delays  of  paths;  x,„  is  the 
parameter  that  specifies  the 
delay  of  the  paths  beyond 
which  the  attenuation  factor  is 
below  a  certain  threshold 


Threshold 


2.  What  is  the  relationship  between  the  number  of  reflections  and  the  delay 
experienced  by  the  signal  travelling  along  a  particular  path?  We  can  assume, 
in  general,  that  signal  travelling  along  a  longer  path  is  more  likely  to  encounter 
more  reflections.  Meanwhile,  this  longer  path  usually  means  its  associated  delay, 
%i,  is  larger.  If  we  put  the  two  together,  we  observe  that  the  number  of  reflections 
increases  when  x,  increases. 

If  we  apply  observation  (2)  to  observation  (1),  then  a,  decreases  as  x,  increases. 
This  result  is  plotted  in  Figure  1.32,  where  we  also  define  a  threshold  for  a,. 
This  threshold  is  defined  such  that  for  a  path  with  a,  below  its  value,  a  signal 
travelling  through  this  path  does  not  cause  any  ISI.  Corresponding  to  this  threshold, 
we  define  the  parameter  x„,.  This  xm  can  then  be  interpreted  as  follows:  a  signal  that 
travels  through  any  path  with  delay  larger  than  x„,  does  not  cause  any  ISI. 

The  rate  at  which  a,  decreases  as  t,  increases  depends  on  the  channel. 
For  example,  the  channel  described  in  Figure  1.27  has  a  relatively  flat  plot,  resulting 
in  a  large  xm.  On  the  contrary,  the  channel  described  in  Figure  1.29  would  be 
described  with  a  plot  having  a  large  negative  slope,  resulting  in  a  small  i,„ 

How  do  we  use  Figure  1.32?  As  illustrated  in  the  last  paragraph,  for  every  channel 
there  is  a  plot  that  is  similar  to  Figure  1.32,  but  with  a  different  x„, .  From  these  plots 
the  t„,  of  these  channels  can  be  determined.  We  then  go  to  condition  (1.57)  and 
decide  if  the  channel  is  flat. 


1.10.1.4     Coherence  Bandwidth 


Remember  that  in  subsection  1.6.2  we  discussed  how  a  classical  channel  with  finite 
bandwidth  introduces  ISI.  In  the  discussion  of  Figure  1.27  we  showed  that  a 
multipath  fading  channel  can  also  introduce  ISI.  We  would  therefore  like  to 
associate  the  concept  of  finite  bandwidth  to  a  multipath  fading  channel  as  well. 
To  do  this  we  revisit  the  concept  of  time-varying  channel  model  (introduced  in 
section  1.9.1)  that  is  used  to  describe  a  multi-path  fading  channel.  We  will  show 
that  the  channel  model  allows  us  to  interpret  the  fading  channel  as  a  filter,  and 
therefore  the  concept  of  bandwidth  can  be  applied  to  the  fading  channel.  We  further 
show  that  we  can  relate  the  bandwidth  of  the  fading  channel  to  x„,. 
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First  let  us  repeat  (1.36): 

XR(t)  =  V2Re   \  J2  «i(t)si(t  -  t,-(0)  '  e~j2nfM,)  \  ■  ^  ( 1 .58 ) 


Now  let  us  reinterpret  this  equation.  From  (1.58)  xR(t)  is  seen  to  be  a  sum  of 
different  reflected  signals,  each  with  their  own  delay  x,  and  attenuation  factor  a,. 
These  are  the  delays  and  attenuation  introduced  by  the  paths  of  the  particular 
channel.  For  example,  for  the  channel  described  in  Figure  1.27,  we  have  Xj=0, 
x2=2  ms,  x3=0.01  ms,  <Xi=l,  oc2=0.8  a3=0.8;  and  for  the  channel  described  in 
Figure  1.29,  we  have  Ti=0,  x2=2  ms,  x3=0.01  ms,  (Xi=l,  a2=0.08,  oc3=0.08. 
We  first  assume  that  these  parameters  are  time  invariant.  Then  (1.58)  describes 
the  input-output  relationship  similar  to  that  of  a  time-invariant  finite  impulse 
response  (FIR)  filter.  To  see  this  more  clearly,  let  us  remove  the  carrier  term  from 
(1.58)  and  make  a,  and  x,  time-invariant.  Hence  we  have 

xR{t)  =^2ociSi{t-Ti)  (1.59) 

i 

We  can  now  identify  in  ( 1 .59)  the  input  as  Si(t-Tj),  the  output  as  xR(t).  If  we  further 
make  x,  the  same  for  each  i  and  identify  it  as  the  sample  period,  then  (1.59)  describes 
the  input-output  relationship  of  an  FIR  filter.  The  tap  weight  of  the  FIR  filters  are  the 
a,,  and  the  length  of  the  filter  is  the  length  of  the  sum.  As  with  any  filter,  this  filter  can 
also  be  described  by  a  transfer  function  and  has  a  frequency  response.  If  we  reinsert 
the  carrier  term,  we  are  essentially  shifting  the  frequency  response  of  this  filter  so  that 
its  center  frequency  is  around  fc.  For  simplicity,  we  ignore  the  effect  of  the  carrier  in 
the  following  discussion  and  deal  directly  with  the  baseband  signal. 

Next  let  us  assume  that  this  filter  has  a  certain  bandwidth.  To  interpret  this 
bandwidth,  like  the  bandwidth  of  an  ordinary  filter,  we  will  go  through  the 
following  argument.  Let  us  send  a  baseband  signal  whose  symbol  frequency, 
fh  (=1/7"/,),  is  much  larger  than  1/  im.  Then  Th  is  much  smaller  than  x„„  and 
condition  (1.57)  is  violated,  resulting  in  severe  ISI.  In  other  words,  the  baseband 
signal  cannot  be  reproduced.  If  we  treat  the  fading  channel  as  a  filter,  we  can 
roughly  interpret  this  as  saying  that  the  baseband  signal  does  not  go  through  the 
filter.  Next,  we  send  another  baseband  signal  whose  symbol  frequency//,  (=1/Th),  is 
much  smaller  than  1/  x„,.  Then  Th  is  much  larger  than  xm,  and  condition  (1.57)  is 
satisfied,  resulting  in  little  ISI.  Again,  if  we  treat  the  fading  channel  as  a  filter,  we 
can  roughly  interpret  this  as  saying  that  the  baseband  signal  passes.  Hence  the 
channel  behaves  like  a  low-pass  filter  with  a  filter  bandwidth  of  l/xm  (band-pass 
filter  if  we  reintroduce  the  carrier).  In  this  context  we  can  define  the  bandwidth  of 
the  channel,  denoted  as  coherence  bandwidth  (A/)c,  as 

(&f)c=—  (1-60) 
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Fig.  1.33    (a)  Flat  fading,  (b)  frequency  selective  fading 

In  essence,  the  coherence  bandwidth  indicates  the  bandwidth  over  which  the 
signal  undergoes  practically  the  same  shifts  and  attenuation  during  transmission 
over  a  multipath  channel. 

Condition  (1.57)  can  now  be  restated  in  terms  of  the  coherence  bandwidth  as 


fb«(Af)c 


(1.61) 


For  example,  we  can  take  ft,  to  be  1.72MHz  in  DECT. 

Using  this  new  condition,  a  fading  channel  that  satisfies  (1.61)  is  called  a  flat 
fading  channel  and  is  shown  in  Figure  1.33  (a).  A  fading  channel  that  violates  this  is 
referred  to  as  a  frequency  selective  channel  and  is  shown  in  Figure  1.33  (b).  In  Figure 
1.33  we  have  reintroduced  the  carrier  term  and  so  the  channel  is  a  band-pass  filter. 

Finally,  let  us  revisit  the  assumption  of  making  a,  and  x,  time  invariant.  If  this  is 
not  true,  then  the  channel  becomes  a  time-varying  channel  represented  by  a  time- 
varying  filter,  but  the  concept  of  coherence  bandwidth  remains  valid. 


1.10.1.5    ISI  Degradation 

The  ISI  caused  by  frequency  selective  fading  will  degrade  the  BER.  Normally,  this 
can  be  compensated  by  increasing  the  SNR.  However,  ISI  can  become  severe  that  we 
reach  a  situation  whereby  the  BER  remains  flat  no  matter  how  much  we  increase  the 
SNR.  This  BER  is  called  the  irreducible  BER.  This  irreducible  BER  is  a  function  of 
how  frequency  selective  the  channel  is,  which  in  turn  is  quantified  by  the  ratio  d=xm/ 
Th,  called  the  normalized  delay  spread.  A  plot  of  this  irreducible  BER  versus  the 
normalized  delay  spread  is  shown  in  Figure  1 .34,  for  two  modulation  schemes.  Let  us 
now  examine  an  example  and  see  how  to  use  Figure  1.34  to  find  out,  for  a  given 
wireless  channel,  whether  a  given  standard  is  satisfied. 
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Fig.  1.34  Irreducible  BER  as  a  function  of  normalized  delay  spread  for  different  modulation 
schemes,  frequency  selective  fading 

Numerical  Example  1.4 

For  DECT  from  Table  1.1  we  have  a  bandwidth  of  1.7  MHz,  which  gives  a  symbol 
period  of  0.6  us.  Measurement  on  a  given  indoor  channel  shows  a  delay  xm  =100  ns, 
hence  d=100  ns/0.6us  or  0.16.  From  Figure  1.34  this  gives  a  BER  of  about  10~  . 
Hence  this  does  not  satisfy  the  DECT  specification  in  Table  1.1. 


1.10.1.6     Equalization 

Numerical  example  1.4  shows  that  the  ISI  is  so  poor  that  the  BER  requirement 
cannot  be  met.  To  combat  ISI,  an  equalization  technique  can  be  adopted.  Crudely 
speaking,  equalization  can  be  thought  of  as  compensating  for  the  amplitude  and 
phase  distortion  of  the  channel  with  a  filter  whose  transfer  function  is  the  inverse  of 
the  channel's  transfer  function.  This  filter  is  called  an  equalizer.  If  the  channel  is 
time  varying,  the  equalizer  has  to  be  adaptive  as  well.  This  can  be  achieved  by  first 
sending  a  known  training  sequence  to  the  receiver,  so  that  the  equalizer  averages  to  a 
proper  setting.  Then  when  the  actual  signal  is  sent,  the  equalizer  adopts  a  recursive 
algorithm  to  evaluate  the  channel's  time- varying  transfer  function  and  adapts  itself 
accordingly.  Equalizers  can  be  broadly  classified  as  linear  (FIR,  lattice,  etc.)  or 
nonlinear  (decision  feedback  equalizer,  maximum  likelihood  sequence  estimation 
equalizer,  etc.).  A  complete  treatment  can  be  found  in  [4].  We  assume  that  one  such 
equalizer  is  applied  to  the  receiver  used  in  Numerical  example  1 .4  to  help  restore  the 
BER  to  the  acceptable  level. 
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Fig.  1.35    OFDM  QPSK  on  multi  carriers 

1.10.1.7     Diversity  and  OFDM 

As  data  rate  increases  in  3G  and  4G,  rate  of  equalization  has  to  be  increased 
accordingly,  and  may  become  impossible  to  implement  under  a  given  VLSI  tech- 
nology. To  overcome  this  limitation,  diversity,  as  mentioned  in  section  1.9.2.3,  can 
be  adopted.  Let  us  illustrate  this  with  QPSK  modulation,  as  shown  in  Figure  1.35. 
As  explained  in  section  1.5.3,  for  a  QPSK  signal,  a  dibit  (00,  01,  10,  1 1)  is  encoded 
as  one  dot  in  the  4  quadrants  of  a  circle,  as  shown  in  the  top  part  of  Figure  1.35. 
Here  the  circle  is  represented  in  a  2-dimensional  plot.  The  next  part  shows  how 
transmission  is  represented,  using  a  3-dimensional  plot.  The  2-dimensional  circle  is 
tilted  in  the  representation,  with  the  31  dimension,  drawn  horizontally,  representing 
time.  Nominally  one  symbol  is  transmitted  per  symbol  period,  so  that  within 
each  symbol  period,  one  circle  is  used  to  represent  the  transmitted  symbol.  Thus  in 
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the  first  symbol  period,  dibit  00  is  transmitted  and  in  the  second  period,  dibit  1 1  is 
transmitted.  Here  one  circle  on  represents  one  symbol  nominally  transmitted  one 
per  symbol  period.  To  improve  transmission  rate,  we  can  squeeze  several  symbols 
into  the  same  time  period,  for  example,  4  of  them  as  shown  in  next  part  of  Figure  1.35. 
Here  the  dibits  transmitted  are  00,  11,  10,  01.  This  can  have  severe  consequences 
though,  such  as  equalization  rate  has  to  increase  4  times,  as  mentioned  above. 
To  overcome  this,  we  can  space  symbols  in  frequency,  rather  than  in  time. 
As  shown  in  bottom  part  of  Figure  1.35,  each  symbol  is  mapped  into  frequency 
domain,  all  within  this  same  time  period.  With  4  symbols,  there  are  4  frequency 
components:  f0,  fi,  f2,  f3,  each  with  different  amplitude.  This  mapping  is  done  using  a 
Fast  Fourier  Transform  (FFT),  and  instead  of  sending  4  QPSK  dibits  in  one  time 
period,  we  send  the  FFT  of  these  4  QPSK  dibits  in  the  same  time  period.  Thus  date 
rate  increases  without  increasing  equalization  rate.  The  implementation  is  shown  in 
Figure  1.36.  In  the  transmitter  the  Inverse  Fast  Fourier  Transform  (IFFT)  is  used 
for  modulating  the  data  constellation  on  the  frequency  components.  If  N  is  the 
number  of  FFT  points  in  the  IFFT,  then  N  data  constellation  points  are  operated  on 
(here  N=4,  frequency  components  f0-f3  for  the  4  QPSK  dibits  ).  The  IFFT  generates 
these  N  output  samples  in  time  domain.  They  are  the  baseband  signals.  They  are  then 
mapped  onto  the  set  of  N  orthogonal  subcarriers.  Here  N=4=2  ,  a  power  of  2;  in 
general  N  is  also  typically  a  power  of  2.  The  mapped  baseband  signal  is  then  converted 
to  analog  signal  by  the  digital  to  analog  converter  (DAC).  In-phase/quadrature  (I/Q) 
modulation  and  up-conversion  are  then  performed,  and  the  output  amplified  by  a 
power  amplifier  (PA)  and  transmitted  on  an  RF  carrier.  On  the  receiver,  upon 
amplified  by  a  low  noise  amplifier  (LNA),  the  signal  is  downconverted  and  digitized 
by  the  analog  to  digital  converter  (ADC).  FFT  is  applied  to  demodulate  the  OFDM 
signals  (here  the  4  frequency  components  f0-f3).The  equalization  (symbol  de- 
mapping)  required  for  detecting  the  data  constellation  is  performed  by  multiplying 
the  FFT  output  by  the  inverse  of  the  estimated  channel  transfer  function. 

With  all  the  subcarriers  present,  it  is  not  surprising  that  synchronization  is 
important.  Both  time  and  frequency  synchronization  is  necessary.  Frequency 
synchronization  is  needed  to  align  the  modulator  and  demodulator's  local  oscillator 
frequencies,  on  all  the  subcarriers.  Time  synchronization  is  needed  to  identify  the 
start  of  the  OFDM  symbol.  Given  that  the  subcarriers  needed  to  be  orthogonal, 
synchronization  is  important  since  loss  of  synchronization  results  in  loss  of  ortho- 
gonality. Then  overlapping  of  subcarriers  is  an  issue,  resulting  in  intersymbol 
interference  and  interchannel  interference. 


1.10.2    Fast  fading 

What  happens  if  the  channel  is  changing  very  fast?  This  time  variation  can  come 
from  atmospheric  variation,  movement  of  mobile  receivers,  or  movement  of 
reflectors.  Let  us  look  at  (1.37)  again.  It  turns  out  that  one  of  the  most  prominent 
effects  comes  from  the  changing  of/c  in  (1.37)  due  to  time  variation.  This  would 
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have  a  pronounced  effect  on  xR(t).  To  illustrate  the  effect  due  to  a  change  in/0  we 
use  an  example  that  most  readers  will  be  familiar  with:  the  Doppler  shift  associated 
with  the  Doppler  effect. 

The  Doppler  effect  refers  to  the  change  in  frequency  that  occurs  at  a  point  of 
observation  as  a  moving  sound  source  passes  that  point.  A  familiar  example  may  be 
the  pitch  change  heard  in  a  train's  horn  as  it  passes  a  crossing.  Similar  phenomena 
affect  the  received  signals  in  wireless  communication  when  the  mobile  unit  is  in 
motion.  Assume  that  a  mobile  phone  user  is  sitting  in  a  car  in  a  parking  lot,  near  a 
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Fig.  1.37    Illustration  of 

possible  signal  reception  for 

the  mobile  unit  in  motion 

\ 


busy  highway.  Although  the  user  is  relatively  stationary,  part  of  the  environment  is 
moving  at  120  km  per  hour.  The  automobiles  on  the  highway  become  reflectors  of 
radio  signals.  If  the  mobile  phone  user  is  now  also  driving  on  the  highway,  then  the 
reflected  signals  vary  at  a  faster  rate  with  respect  to  the  receiver  (the  driver). 
The  rate  of  variations  of  the  signal  is  described  as  Doppler  spread.  Intuitively,  the 
multipath  fading  channel  varies  faster  when  the  mobile  unit  is  moving  at  a  higher 
speed.  This  situation  is  illustrated  in  Figure  1.37. 

Let  the  i  th  reflected  wave  arrive  from  an  angle  a,  relative  to  the  direction  of  the 
motion  of  the  Rx  antenna.  If  the  mobile  unit  is  stationary,  the  carrier  frequency  of 
the  received  signal  can  be  expressed  as  follows:  fc  =  cfk.  In  the  case  where  the 
mobile  unit  is  in  motion,  the  relative  speed  of  the  radio  wave  with  respect  to  the  Rx 
antenna  changes.  This  result  in  a  frequency  change,  and  the  modified  frequency,  fm, 
can  be  expressed  as:/m  =  (c±v)/\  =fc  ±  A/.  Accordingly,  the  Doppler  shift  of  this 
situation  is 

v 
Afj  =  —  cos  a,-  (1-62) 

A 

where  v  is  the  speed  of  the  mobile  unit.  Note  that  v-cosa,  is  the  projection  of  the  speed 
on  the  received  signal  path  direction,  since  only  this  component  causes  the  Doppler 
shift.  Accordingly,  the  maximum  Doppler  shift  occurs  for  a  wave  coming  from  the 
opposite  direction  toward  which  the  antenna  is  moving.  This  maximum  shift,  denoted 
as/d,  is  given  by: 

v      vy.fr 
fd  =  T= (1.63) 

A  C 

This  Doppler  spread, /d,  has  a  similar  interpretation  as  the  time-delay  spread,  xm. 
Referring  to  Figure  1 .38  if  a  carrier  wave  is  transmitted  at  frequency  fc,  after  transmis- 
sion over  a  fast  fading  channel  we  receive  a  power  spectrum  that  is  spread  over  a 
certain  frequency  range.  Note  that  the  shape  of  the  distribution  can  be  calculated 
theoretically.  It  turns  out  that  it  has  a  specific  shape  under  the  assumption  that  the 
phase  introduced  by  the  channel  is  uniformly  distributed  over  the  interval  [0,  2n]. 
The  frequency  range  where  the  power  spectrum  is  nonzero  defines  the  Doppler  spread. 

So  far  we  have  shown  that  there  is  a  carrier  frequency  change  when  the  mobile 
unit  is  in  motion.  How  is  a  change  in  carrier  frequency  related  to  its  phase?  From  a 
phasor  diagram  interpretation,  we  observe  that  in  the  case  where  we  have  two 
sinusoidal  signals  with  different  frequencies  they  rotate  with  a  different  speed, 
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Fig.  1.38  Power  profile  of  a 
Doppler  shifted  channel  in  the 
frequency  domain 


Power  spectrum 


frequency 


generating  a  time-varying  relative  phase.  The  beat  frequency  (frequency  difference) 
determines  the  rate  of  change  of  this  relative  phase.  In  the  case  where  the  Doppler 
effect  causes  a  frequency  change  the  beat  frequency  would  simply  be  the  amount  of 
Doppler  frequency  shift.  It  can  be  observed  that  this  motion  of  the  Rx  antenna  leads 
to  time-varying  phase  shifts  of  individual  reflected  waves.  Many  waves  arrive  at  the 
Rx  antenna,  all  with  different  phase  shifts,  and  since  their  relative  phases  change 
with  time  their  effect  on  the  amplitude  and  phase  of  the  resulting  composite  signal 
is  also  time  varying.  Therefore,  the  Doppler  effect  determines  the  rate  of  change  at 
which  the  amplitude  and  phase  of  the  resulting  composite  signal  change.  We  next 
examine  the  impact  of  this  effect. 


1.10.2.1     Impact  of  the  Doppler  Effect 

Baseband  Signal  Amplitude  Degradation 

As  mentioned  in  the  previous  subsection,  in  order  to  detect  the  baseband  data 
sequence  successfully  at  the  receiver  we  have  to  preserve  the  envelope  of  the 
transmitted  signal,  where  in  the  ideal  case  the  envelope  would  remain  constant 
during  the  transmission.  To  model  the  reception  in  a  time-varying  multipath 
channel  we  start  from  (1.36)  again,  but  noting  that  the  x,  is  time  varying  in  the 
present  case: 


XR(t)  = 


V2R. 


Ml)   (    .  ej2nfct 


(1.64) 


Here  At{t)  summarizes  the  attenuated,  delayed  baseband  signal  and  6,(f)  is  the 
time-varying  phase  of  the  i  th  path. 

The  phases  are  varying  rapidly  with  time  because  very  minor  receiver  movements 
are  large  with  respect  to  the  wavelength  of  the  propagation.  As  a  start,  we  assume 
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that  the  mobile  unit  is  stationary;  then  the  phase  shift  introduced  by  the  channel  for 
the  i  th  path  can  be  expressed  as 

4>j  =  2nfc%i  =  2n-  ■  —  =  2n4  (1.65) 

A      C  A 

where  X  is  the  wavelength  of  the  carrier  signal  and  st  is  the  additional  distance 
traveled  by  the  reflected  wave  of  the  i  th  path  with  respect  to  the  direct  path.  In  the 
case  where  the  mobile  unit  is  in  motion,  the  distance  st  varies  with  time  (linear  with 
respect  to  the  speed  of  the  receiver).  Then  the  time-varying  phase  (for  the  i  th  path  ) 
can  be  decomposed  into  a  time-invariant  part  and  a  time-varying  part  as 

0,-CO  =  0,-  +  2*  y-  —  (1-66) 

A  C 

Here  Asj(t)  is  the  time  varying  part  of  .v,(f). 

To  simplify,  we  assume  that  As  iff)  is  given  by  its  maximum  value,  which  is  vxf. 
Then  </>,-(0  is  given  by 

4>i{t)  =  ^>i  +  2nV^  (1.67) 

A 

Since  v/X  represents  the  maximum  possible  Doppler  frequency  shift  as  derived 
in  (1.63),  we  can  model  the  time-varying  phase  as  follows: 

<t>i{t)  =  {2n-fd)t  +</>,-  (1.68) 

where  (f>t  are  random  phases  uniformly  distributed  from  0  to  2n  and  fd  is  the 
maximum  Doppler  frequency  shift  due  to  the  motion  of  the  mobile  unit.  Substituting 
(1.68)  in  (1.64)  and  writing  the  complex  exponentials  in  terms  of  their  real  and 
imaginary  components,  we  get  an  expression  of  xR(i)  in  terms  of  the  in-phase 
and  quadrature  components: 

XR(t)  =  si(t)  cos(2nfct)  -  sQ(t)  sin(2nfct)  (1.69) 

where 

si(t)  =  J2AM  ™<2*fdt+  4>i)  (1-70) 

i 

sQ(t)  =  J2AiV)  sin(27i/df+  </»,.)  (1.71) 

i 

The  envelope  of  the  received  signal  xR(t)  is  given  by 

XRit)emelopt=  \l  si{t?+sQ{tf  (1.72) 
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Fig.  1.39  (a)  The  mobile  unit  moves  at  a  speed  of  v  with  respect  to  the  Tx  antenna,  causing  time- 
varying  phase  changes,  (b)  The  continuous  lines  represent  the  phasors  of  the  two  signals  received 
at  time  fo  (position  1)  and  the  dashed  lines  represent  the  phasors  of  two  signals  received  at  time 
to+At  (position  2) 


Notice  that  (1.70)  through  (1.72)  are  similar  to  (1.42)  through  (1.44),  except  that 
the  S[  and  Sq  expressions  are  time  varying.  Hence  the  envelope  would  also  have  a 
Rayleigh  distribution,  except  the  distribution  becomes  time  varying  in  the  present 
case.  From  (1.70)  through  (1.72)  we  observe  that  fd  determines  the  rate  at  which 
deep  fading  of  the  envelope  occurs. 

One  possible  way  to  quantify  time-varying  envelope  fading  is  by  the  use  of  a 
time-varying  phasor  diagram.  An  example  is  illustrated  in  Figure  1.39b. 
The  channel  itself  is  shown  in  Figure  1.39(a),  where  we  have  assumed  that  there 
are  two  paths  in  the  channel. 

Referring  to  Figure  1.39,  you  may  think  that  after  each  At  we  take  a  picture  of 
the  time-varying  phasor  diagram.  Due  to  the  time-varying  relative  phase  shifts,  the 
two  paths  may  interfere  in  a  destructive  or  in  a  constructive  fashion.  Obviously,  the 
time  interval  of  interest  is  the  signaling  interval  T/„  since  during  that  time  we  want  to 
preserve  the  amplitude  of  the  envelope.  We  observe  from  (1.70)  through  (1.72)  that 
the  envelope's  amplitude  changes  (goes  through  a  deep  fade)  periodically  at  a 
frequency  fd.  Thus,  to  quantify  the  time-varying  nature  of  the  channel,  we  define  the 
coherence  time  (At)c,  which  is  related  to  the  maximum  Doppler  frequency  shift  as 


(A*)c 


1 
''Id 


(1-73) 


Substituting  (1.63)  into  (1.73)  also  allows  us  to  establish  a  relation  between  the 
coherence  time  (At)c  and  the  speed  v  of  the  mobile  unit.  What  does  the  coherence 
time  of  a  multipath  channel  tell  us  now?  In  the  case  where  the  mobile  unit  is 
stationary  (v  is  zero),  the  coherence  time  (At)c  is  infinitely  long,  because  there  is  no 
time-varying  phase  change.  In  the  case  when  the  mobile  unit  is  moving  during 
transmission,  there  are  two  scenarios  here.  In  the  first  scenario  the  signaling  interval 
Ti,  is  much  smaller  than  the  coherence  time  (A?)c  of  the  channel.  Here  the  channel 
attenuation  and  phase  shift  are  essentially  fixed  for  the  duration  of  at  least 
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Fig.  1.40    BER  as  a  function  of  SNR,  with  Doppler  effect,  for  GMSK  modulation 

one  signaling  interval.  We  refer  to  this  as  slow  fading.  The  important  relation  to 
ensure  slow  fading  is 


n  <  (Ar)c 


(1.74) 


In  the  second  scenario  we  have  Th  ^>  (At)c.  Here  the  received  signal  is  severely 
distorted,  since  the  rate  of  change  in  the  envelope's  amplitude  is  rather  large. 
The  channel  is  said  to  be  fast  fading. 


Random  Phase  and  Frequency  Shift 

Because  of  the  Doppler  shift,  the  frequency  is  shifted  and  the  phase  is  also  shifted. 
In  addition,  since  the  Doppler  shift  happens  in  a  random  manner,  this  phase  shift  is 
also  random  in  nature.  In  the  case  when  we  encode  the  information  in  phase  (like 
MSK),  such  a  change  in  phase  leads  to  a  reduction  in  BER.  Up  to  a  certain  point, 
this  reduction  in  BER  can  be  restored  by  increasing  SNR.  However,  if  the  fading 
becomes  too  fast,  increasing  SNR  does  not  help  and  the  BER  remains  fixed  at  an 
irreducible  level  [7],  [8],  and  [9],  as  shown  in  Figure  1.40.  This  is  to  be  expected 
because  the  pertubration  caused  by  fast  fading  is  multiplicative  in  nature  and  hence 
insensitive  to  signal  amplitude. 

Numerical  Example  1.5 

A  car  drives  with  v  =  120  km/h  on  the  highway.  The  driver  uses  a  DECT  mobile 
phone.  Then  the  maximum  Doppler  frequency  shift  is  given  by  (1.63): 


fd 


I20km/h  ■  1900MHz 
3  •  10&m/s 


21  Wz 


(1-75) 


1.11     Summary  of  Standard  Translation  59 

Since  the  Doppler  frequency  shift  can  go  both  ways  depending  on  the  direction  of 
the  reflected  energy,  the  bandwidth  of  the  received  signal  is  approximately  106  Hz, 
and  from  (1.73)  the  coherence  time  (At)c  m  9.4ms.  For  DECT  Th  ss  0.58  us.  From 
(1.74)  we  conclude  that  the  channel  is  slowly  fading.  Hence  the  envelope's  amplitude 
goes  through  a  deep  fade  rather  infrequently. 

We  will  also  check  the  impact  on  BER  due  to  phase/frequency  shift  for  GMSK 
modulation,  by  using  Figure  1 .40.  For  velocity  up  to  1 50  km/h,  if  SNR  is  larger  than  25 
dB,  then  BER  is  less  than  10"   and  hence  the  BER  requirement  of  DECT  is  satisfied. 


1.10.2.2     Time  Diversity 

In  Figure  1.40  we  see  that  as  speed  increases,  BER  increases.  To  compensate  for  that 
we  have  to  increase  the  SNR,  which  can  be  costly.  A  possible  technique  to  improve  the 
received  signal  BER  without  increasing  SNR  is  called  time  diversity.  In  time  diversity 
the  same  information-bearing  signal  is  transmitted  in  n  different  time  slots,  where 
the  separation  between  different  time  slots  exceeds  the  coherence  time  (Af)c  of  the 
channel.  Consequently,  the  signals  are  affected  differently  by  the  fading  channel. 


1.10.3     Comparisons  of  Flat/Frequency  Selective 
and  Slow/Fast  Fading 

Figure  1 .41  [4]  shows  a  comparison  of  flat/frequency  selective  and  slow/fast  fading,  as 
well  as  the  criteria  for  comparison.  The  most  benign  environment  of  course  is  the  slow, 
flat  fading  channel.  From  Figure  1.41,  this  is  achieved  by  picking  a  signalling  interval 
Th  such  that  Th  >  x„,  (to  achieve  flat  fading)  and  Th  <  (At)c  (to  achieve  slow  fading). 
Other  channel  environments  can  be  achieved  by  selecting  Th  accordingly. 


1.11     Summary  of  Standard  Translation 

We  have  applied  communication  concepts,  such  as  modulation/pulse  shaping  and 
fading,  to  the  modulator/demodulator  block  and  channel  block,  respectively. 
This  allows  us  to  translate  standards'  requirements  to  the  boundary  conditions 
imposed  on  the  receiver  front  end.  These  boundary  conditions  are  functions  of  the 
wireless  channel  environment,  modulation/demodulation  scheme,  and  bit  error  rate 
of  the  standard.  When  we  apply  this  translation  to  a  specific  standard,  the  DECT 
standard,  we  have  the  following: 

1.  Due  to  path  loss,  the  receiver  can  expect  to  have  a  received  signal,  from  the 
channel,  whose  minimum  carrier's  amplitude,  Pmin,  is  -77  dBm.  Hence  the  front 
is  -77  dBm. 
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2.  Due  to  envelope's  fading  and  AWGN  in  the  channel,  the  SNR  at  the  input  of  the 
demodulator,  denoted  as  SNRdemodin,  is  required  to  be  25  dB.  We  assume  that 
no  diversity  technique  is  applied  and  hence  the  SNR  at  the  output  of  the  front 
end,  denoted  as  SNRrec_front_out,  is  required  to  be  25  dB. 

3.  Due  to  frequency  selective  fading,  BER  is  larger  than  the  required  10-3  level. 
We  assume  that  equalization  is  applied  in  the  demodulator  block  to  restore  the 
BER  rate  to  the  10~3  level.  Hence  no  extra  requirement  is  put  on  the  front  end. 
The  channel  is  slow  fading:  hence  the  envelope's  amplitude  goes  through  deep  fade 
rather  infrequently.  BER  for  mobile  velocity  up  to  150  km/h  remains  below  10~3 
and  no  time  diversity  is  needed.  Hence  no  extra  requirement  is  put  on  the  front  end. 


1.12     Problems 
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1.1  To  illustrate  where  some  of  the  numbers  given  in  a  standard  come  from, 
consider  the  DECT  standard  again.  Derive,  using  proper  approximation  and 
other  information  given  in  the  DECT  standard,  that  the  channel  bandwidth  is 
1.728  MHz. 

1.2  For  the  MSK  scheme  as  shown  in  Figure  P.l,  assume  that  the  signal  at  nodes 
A  and  B  is  as  given.  Show  the  corresponding  waveform  at  nodes  C,  D  and 
Smsk(o-  Assume  that  the  time  between  the  dotted  lines  is  Wr1?  where  T;  is  the 
time  when  the  signal  at  node  A  is  a  1 . 

1.3  Would  the  Pe  versus  Eh/N0  curve  be  identical  for  BPSK  and  QPSK?  Explain. 

1 .4  In  this  chapter  we  showed  plots  of  the  Pe  of  different  modulation  schemes 
using  SNR  as  an  independent  variable  (Pe  vs  SNR  plot).  On  the  other  hand, 
communication  texts  usually  show  plots  of  Pe  of  different  modulation  schemes 
using  Eh/N0  as  an  independent  variable  (Pc  vs  Eh/N0  plot).  Comment  on  why,  as 
an  independent  variable,  SNR  may  be  a  better  choice  for  circuit  designers 
whereas  Eh/N0  may  be  a  better  choice  for  communication  system  engineers. 

1.5  Explain,  with  the  help  of  time  domain  graphical  representations,  using  a  sine 
pulse  superimposed  on  a  carrier,  how  multipath  fading  causes  fluctuation  in  an 
envelope's  amplitude  as  well  as  introduces  distortion  in  the  envelope. 

1.6  For  the  NLOS  path  loss  calculation,  show  that,  using  a  very  crude  approxima- 
tion and  involving  only  two  paths  (one  direct,  one  reflected),  the  exponent  n 
in  (1.24)  is  4. 

1.7  In  Figure  1.32,  we  stated  that  as  delay  increases,  attenuation  increases. 
Can  you  justify  why  this  is  true? 
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1.8  In  (1.59)  Xr(i)  —  ^2j  (tjSi(t  —  Xj)  and  we  say  that  this  describes  a  filter  similar 
to  a  FIR  filter.  Comment  on  the  validity  of  this  statement. 

1 .9  We  noted  that  the  concept  of  coherence  bandwidth  is  important  to  wireless 
communication  system  designers.  Comment  on  its  relevance  to  wireless  circuit 
designers.  How  do  they  incorporate  this  in  their  design  process? 

1.10  We  mentioned  that  BER  due  to  fast  fading  and  frequency  selective  fading 
reaches  an  irreducible  level  for  a  large  enough  SNR.  Offer  a  qualitative 
explanation. 
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Chapter  2 

Receiver  Architectures 


2.1     Introduction 

In  Chapter  1  we  focused  on  the  blocks  enclosed  by  dotted  lines  in  Figure  1.1. 
They  are  the  modulator,  demodulator,  and  channel  part  of  the  communication 
system.  We  now  redraw  Figure  1.1  as  Figure  2.1,  with  the  receiver  front  end 
enclosed  by  dotted  lines.  In  this  chapter  and  the  rest  of  the  book,  we  emphasize 
this  front  end  part  of  the  communication  system.  Specifically,  in  this  chapter  we 
discuss  the  architecture  of  the  front  end  as  well  as  the  filters  inside,  and  in  the  rest  of 
the  book  we  concentrate  on  the  design  of  active  components  inside  the  front  end. 
We  first  review  some  general  philosophy  on  deciding  the  front  end  architecture. 
We  then  adopt  specific  receiver  architecture  and  translate  the  boundary  conditions 
imposed  by  the  channel  and  the  demodulator  on  the  front  end  (specified  in  Chapter  1 
in  terms  of  communication  concepts  [SNR])  to  circuit  concepts  (gain,  noise  figure, 
distortion).  Next  we  translate  these  boundary  conditions  into  boundary  conditions  of 
the  front  end  sub  blocks.  Examples  of  these  sub  blocks  include  low-noise  amplifiers 
(LNA),  mixers,  and  intermediate  frequency  (IF)  amplifiers.  In  subsequent  chapters, 
these  boundary  conditions  will  be  translated  into  the  design  of  various  sub  component 
specifications. 
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2.2     Receiver  Front  End:  General  Discussion 
2.2.1     Motivations 

The  first  question  we  must  ask  is,  why  do  we  need  a  front  end?  To  answer  it,  we 
repeat  the  boundary  conditions  imposed  by  the  channel  and  modulator/demodulator 
for  the  front  end,  derived  in  Chapter  1  for  the  DECT  example: 

1.  Due  to  path  loss,  the  receiver  can  expect  to  have  a  received  signal,  from  the 
channel,  whose  minimum  carrier's  amplitude,  Pmin,  is  -77  dBm.  Hence  the  front 
end's  Pmin  is  -77  dBm. 

2.  Due  to  envelope's  fading  and  AWGN  in  the  channel,  the  SNR  at  the  input  of  the 
demodulator,  denoted  as  SNRdemod_in,  is  required  to  be  25  dB.  We  assume  that 
no  diversity  technique  is  applied  and  hence  the  SNR  at  the  output  of  the  front 
end,  denoted  as  SNRrec_front_out,  is  required  to  be  25  dB. 

Even  though  these  boundary  conditions  have  only  been  determined  for  DECT, 
similar  situations  exist  for  other  standards  as  well.  The  difference  lies  usually  in  the 
numbers.  For  example,  in  GSM,  the  Pmin  number  will  be  much  worse  due  to  a  larger 
cell  size.  To  justify  the  need  for  a  front  end,  let  us  go  back  to  the  DECT  standard. 
We  would  want  to  see,  under  the  worst  input  condition  as  described  in  boundary 
condition  (1),  if  boundary  condition  (2)  would  still  be  satisfied.  In  other  words, 
when  the  antenna  has  a  received  power  of  -77  dBm  and  there  is  only  AWGN  in  a 
fading  channel,  would  the  resulting  SNRdemodin  meet  the  required  SNRdemodin? 
Let  us  assume  that  the  AWGN  comes  just  from  the  thermal  noise  of  the  antenna 
termination  resistance  of  50  Q.  As  shown  in  Problem  2.1,  the  SNRdemod_in  is  a  mere 
5  dB.  Therefore,  we  definitely  do  not  meet  the  required  SNRdemodin  of  25  dB.  This 
conclusion  is  true  in  general  for  other  wireless  standards,  and  hence  in  general  a 
front  end  is  required. 

To  make  the  no-front-end  option  look  even  worse,  let  us  note  that  in  boundary 
condition  (2)  the  noise  source  in  the  SNR  calculation  has  been  assumed  to  be  from 
AWGN  in  the  channel  only.  We  now  state  another  possible  noise  source  in  the 
channel:  interference.  Such  interference  in  the  channel  comes  from  the  following: 

1 .  Adjacent  channel  interference 

2.  Co-channel  interference 

3.  Interference  from  transmit  bands  of  other  wireless  standards,  and  from  the  transmit 
band  of  the  same  channel  should  frequency  division  duplexing  (FDD)  be  used 

Because  these  interferences  are  random  in  nature,  the  demodulator  cannot 
differentiate  them  from  AWGN  and  will  process  them  just  like  noise  during  the 
demodulation  operation.  With  this  additional  noise  source,  SNRdemodin  is  reduced 
even  more,  further  justifying  the  need  for  a  front  end. 
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2.2.2     General  Design  Philosophy 

The  technique  to  combat  a  low  SNRdemod_in  is  by  adding  a  front  end  block,  which 
processes  (conditions)  the  received  signal/A WGN/interference  before  admitting  it 
to  the  demodulator.  This  processing  can  be  done  in  several  ways: 

1 .  Reduce  interference/AWGN.  This  can  be  done  by  putting  a  filter  in  the  front  end 
block,  which  filters  out  the  interference  and  AWGN.  Filtering  for  the  purpose  of 
reducing  interference  to  an  extent  that  the  resulting  SNRdemodjn  meets  the 
required  SNRdemod_in  is  denoted  as  channel  filtering.  At  what  frequency  do  we 
want  to  do  channel  filtering?  Remember,  if  we  want  to  do  channel  filtering  at  RF 
this  involves  performing  a  rather  narrowband  filtering  at  RF.  Doing  narrowband 
filtering  at  RF,  though  conceptually  simple,  is  impractical  since  high-frequency 
narrowband  filtering  is  expensive. 

2.  Amplify  the  desired  signal  at  RF.  This  can  be  done  by  putting  an  amplifier  inside 
the  front  end  block.  Under  this  arrangement  the  antenna  will  be  feeding  its 
output  into  an  amplifier,  whose  output  in  turn  drives  the  demodulator.  Unfortu- 
nately, the  desired  signal,  in  the  worst  case,  is  so  small  that  a  large  amplification 
factor  is  necessary  to  boost  the  SNRdemodin  to  the  required  level.  But  remember, 
when  we  amplify  the  desired  signal,  we  are  also  amplifying  the  interference  and 
AWGN.  In  particular,  the  interference's  power,  being  rather  large,  when  applied 
to  an  amplifier  with  a  large  gain  will  have  the  following  undesirable  effects: 

a)  It  will  saturate  the  amplifier. 

b)  The  amplifier,  being  nonlinear,  will  generate  intermodulation  products  from 
these  interferencers.  Some  of  these  products  can  have  the  same  frequency  as 
the  desired  signal  and  corrupt  it. 

3.  Reapply  (2),  but  this  time  we  want  to  filter  out  the  interference  first.  The  filtered 
interference  has  such  a  small  amplitude  that  when  fed  to  an  amplifier,  it  will  not 
cause  undesirable  effects  as  outlined  in  (2a)  and  (2b).  This  type  of  filtering  is 
denoted  as  interference  suppression  filtering,  to  be  distinguished  from  channel 
filtering.  Interference  suppression  filtering  requirement  is  not  as  stringent  as 
channel  filtering  because  all  we  need  to  do  is  to  filter  out  enough  interference 
so  that  there  are  no  undesirable  effects.  Now  suppose  we  do  this  and  repeat  (2). 
Can  we  amplify  the  desired  signal  at  RF  all  the  way  up  so  as  to  achieve  the 
required  SNRdemodin?  It  turns  out  that  the  required  amplification  for  the  desired 
signal  at  RF  is  so  large  that  the  accompanying  interference  suppression  filtering 
needs  to  be  very  high.  Hence  it  is  not  practical  to  do  this  interference  suppression 
filtering  at  RF. 

4.  Having  gone  through  a  few  alternatives,  it  seems  that  the  following  compromised 
solution  holds  the  best  promise: 

a)  Do  partial  amplification  at  RF  so  that  it  is  practical  to  implement  the  required 
interference  suppression  filtering  at  RF. 
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b)  Frequency  translates  the  partially  amplified  signal  down  to  a  lower 
frequency,  called  the  intermediate  frequency  (IF).  We  can  now  perform 
some  more  amplification  since  it  is  more  practical  to  implement  the  required 
interference  suppression  filtering  at  a  lower  frequency  such  as  IF.  At  this 
lower  frequency  we  can  also  check  and  see  if  the  frequency  is  low  enough 
when  channel  filtering  becomes  practical.  As  a  result  of  the  combination 
of  this  partial  amplification  and  channel  filtering  the  SNRdemod_in  is 
improved.  This  improved  SNRdemodjn  is  then  checked  to  see  if  it  meets 
the  required  SNRdemod_in.  If  it  does  not  we  will  repeat  the  process  until  the 
required  SNRdemodin  is  met.  Then  we  feed  the  final  amplified  and  filtered 
signal  to  our  demodulator.  Of  course  in  the  process  of  frequency  translation 
(unless  the  frequency  translation  is  down  to  baseband  or  DC)  interference 
that  happens  to  be  present  at  the  image  frequency  (to  be  defined  later)  would 
also  be  translated  to  the  desired  frequency  and  has  to  be  filtered  out  before  the 
mixing.  This  is  called  anti-image  filtering. 

In  a  sense  the  process  outlined  in  (4)  reverses  the  attenuation  the  channel 
imposes  on  the  desired  signal  and  suppresses  the  strong  interference  and  noise 
admitted  via  the  channel,  in  a  couple  of  steps.  There  are  numerous  variations  of  this 
approach  but  they  all  follow  a  similar  philosophy.  To  illustrate  this  we  consider  one 
such  architecture,  called  the  heterodyne  architecture. 


2.2.3    Heterodyne  and  Other  Architectures 

A  receiver  front  end,  as  we  have  seen,  needs  to  achieve  different  objectives: 
amplification,  mixing,  filtering,  and  demodulation.  Demodulation  may  include 
diversity  and  equalization.  For  DECT,  given  that  the  speed  of  the  mobiles  is 
typically  low,  as  we  assumed  in  Chapter  1  (Section  1 .8),  then  the  resulting  fading 
is  slow  and  does  not  induce  unacceptable  BER.  Thus  diversity  (to  combat  fast 
fading)  is  not  needed  and  this  simplifies  the  demodulator. 

The  resulting  heterodyne  architecture  is  shown  in  Figure  2.2,  where  we  denote 
all  the  circuitry  enclosed  by  the  dotted  line's  as  the  front  end.  Notice  that  we  have, 
rather  arbitrarily,  lumped  the  antenna  as  part  of  the  channel.  We  can  observe  that 
the  front  end  includes  three  bandpass  filters  (BPF).  One  design  can  be  such  that  the 
bandpass  filter  right  after  the  antenna  (BPF1)  is  used  primarily  to  select  the  band  of 
interest  of  the  received  signal  and  is  referred  to  as  the  band  selection  filter. 
This  usually  provides  enough  suppression  on  out-of-band  interference  that  undesir- 
able effects  described  in  (2a)  and  (2b)  of  sub-section  2.2.2  are  avoided.  Note  that  the 
band  includes  the  entire  spectrum  in  which  the  users  of  a  particular  standard  are 
allowed  to  communicate.  For  the  DECT  standard,  the  receive  band  spans  from  1880 
to  1900  MHz.  Hence  BPF1  has  a  bandwidth  of  around  20  MHz.  As  discussed 


2.2     Receiver  Front  End:  General  Discussion 


67 


antenna 


Receiver 


/  ' 

LO 

\ 

I     BPF1 

.NA 

BPF2 

Mixer 

BPF3  IF  amPlifier|  Demodulator 

i,  «s^      t 

+  ^^ 

— »-^^ ► 

^ — ,     ,     „                , 

J>     .    *                * 

I 
i 

\ 

/ 

Front  end 


Fig.  2.2    RF  receiver  using  a  heterodyne  architecture 


previously  the  desired  signal,  immersed  in  noise  (from  both  AWGN  and  interfer- 
ence), can  be  really  small  at  the  Rx  antenna  (on  the  order  of  microvolts)  and  should  be 
amplified.  This  amplification  should  be  done  with  minimal  additional  noise  injected 
by  the  amplifier  itself,  and  so  a  low-noise  amplifier  (LNA)  is  used.  As  mentioned 
previously,  the  amplification  is  followed  by  mixing,  and  the  mixer  should  be  pre- 
ceded by  an  anti-image  filter.  This  is  BPF2.  This  heterodyne  architecture  adopts  one 
single  frequency  translation  (conversion)  and  hence  is  implemented  by  using  a  mixer 
with  a  variable  local  oscillator  (LO)  frequency  and  a  fixed  IF.  Using  this  approach, 
the  desired  channel  can  be  selected  by  choosing  the  proper  LO  frequency. 

Another  way  to  perform  channel  selection  in  this  architecture  consists  of  using  a 
mixer  that  mixes  down  with  a  fixed  LO  frequency.  The  mixer  is  then  followed  by  a 
tunable  bandpass  filter. 

In  the  present  approach  (variable  LO  frequency)  the  mixer  is  followed  by  a  fixed 
bandpass  filter,  BPF3.  Since  BPF3  operates  at  IF,  we  assume  the  quality  factor  (Q, 
not  to  be  confused  with  Q  for  complementary  error  function)  required  of  this  filter  is 
now  relaxed.  Hence  we  assume  that  channel  filtering  becomes  possible  and  there- 
fore BPF3  is  also  referred  to  as  the  channel  filter.  To  perform  channel  filtering  the 
bandwidth  of  this  filter  depends  on  the  channel  spacing  accorded  to  each  user, 
which  is  unique  for  a  particular  standard.  It  is  one  of  the  most  important  constraints 
on  the  front  end  design  since  this  bandwidth  is  usually  very  narrow  (e.g,  200  kHz  in 
GSM  or  1.728  MHz  in  DECT).  The  channel  filter  is  followed  by  an  IF  amplifier  to 
improve  further  the  SNRdemodjn  of  the  signal.  The  demodulator  then  takes  this 
output  and  performs  demodulation.  For  a  DECT  standard,  demodulation  can  be 
carried  out  in  a  manner  similar  to  Figure  1.7  of  Chapter  1,  which  is  a  form  of 
coherent  demodulation.  This  demodulation  can  be  done  in  analog  or  digital  domain. 
In  the  analog  approach,  all  the  correlators,  and  multipliers  will  be  performed  in  the 
analog  domain.  In  the  digital  approach,  analog  to  digital  conversion  is  performed 
and  all  the  signal  processing  (correlations,  multiplications)  will  be  performed  using 
a  digital  signal  processor.  Also  not  shown  in  the  diagram  is  the  frequency  synthe- 
sizer, which  generates  the  LO  signal. 
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Note  that  there  are  many  other  types  of  possible  receiver  architectures. 
For  example,  in  [1]  a  wideband  IF  double-conversion  scheme  has  been  reported. 
It  uses  a  similar  principle  as  the  heterodyne  architecture,  but  it  uses  two  frequency 
translation  (conversion)  blocks,  with  the  first  block  using  a  fixed  LO  frequency  and 
the  second  block  using  a  variable  LO  frequency.  Further  discussion  of  this  approach 
is  carried  out  in  Problem  2.2.  Problem  2.9  goes  through  one  design  of  such 
architecture.  Yet  another  architecture  uses  only  one  conversion  stage,  where  the 
RF  signal  is  down  converted  to  baseband  directly,  and  is  called  the  homodyne  or 
direct  conversion  architecture. 

Let  us  assume  that  we  are  going  to  use  the  heterodyne  architecture.  We  now 
derive  the  relevant  specifications  of  the  overall  receiver  front  end  and  then  the 
specifications  of  the  subcomponents.  Basically,  the  interference  requirement  allows 
us  to  determine  the  bandwidth  requirements  of  the  passive  subcomponents:  filters. 
The  sensitivity  requirement  will  then  allow  us  to  determine  specifications  on  the 
conversion  gain,  noise,  and  distortion,  first  on  the  front  end  and  then  on  individual 
active  subcomponents. 

In  the  following  section  we  discuss  filter  design  in  some  details.  The  rest  of  this 
chapter  and  subsequent  chapters  concern  the  design  of  the  active  subcomponents. 


2.3     Filter  Design 

One  major  technique  to  combat  interference  is  to  filter  it  out  with  bandpass  filters. 
For  most  bandpass  filters  the  relevant  design  parameters  consist  of  the  center 
frequency,  the  bandwidth  (which  together  with  center  frequency  defines  the  quality 
factor  Q),  and  the  out-of-band  suppression.  We  now  discuss  how  to  derive  the 
specifications  of  these  parameters.  Because  these  filters  are  normally  not  in  integrated 
circuit  form,  their  implementations  are  mentioned  only  briefly. 


2.3.1     Band  Selection  Filter  (BPF1) 

The  bandwidth  of  the  band  selection  filter  is  typically  around  the  band  of  interest 
(for  DECT,  17.28  MHz),  and  the  center  frequency  is  the  center  of  the  band  (for 
DECT,  1.89  GHz).  The  Q  required  is  typically  high  (for  DECT,  1.89  GHz/17.28 
MHz  =  100)  and  the  center  frequency  is  high  as  well  (as  mentioned,  for  DECT  it  is 
1.89  GHz).  On  the  other  hand,  the  suppression  is  typically  not  prohibitive.  It  only 
needs  to  be  large  enough  to  ensure  that  interference  is  suppressed  to  a  point  that  it 
does  not  cause  the  undesirable  effects  as  discussed  in  sub-section  2.2.2.  To  satisfy 
these  specifications,  BPF1  can  be  implemented  using  a  passive  LC  filter.  This  LC 
filter  can  be  combined  with  the  input  matching  network  of  the  LNA,  which  is 
described  in  Chapter  3. 
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Fig.  2.3    Image  rejection  problem  in  the  case  where  FDMA  techniques  are  used  to  separate 
different  users 


2.3.2    Image  Rejection  Filter  (BPF2) 


The  problem  of  the  image  due  to  mixing  was  mentioned  in  sub-section  2.2.2  and  is 
now  explained  in  more  detail.  As  shown  in  Figure  2.3,  during  the  downconversion 
process,  the  desired  signal  at  corf,  the  radio  frequency,  and  its  image  at  03image,  the 
image  frequency,  are  downconverted  to  the  same  frequency  of  coif,  the  intermediate 
frequency.  Hence  the  desired  signal  is  corrupted.  For  the  downconverted  image  and 
downconverted  desired  signal  to  overlap,  a>image  is  spaced  2  <Bif  apart  from  colf: 


Mi, 


o)rf  -  2coif 


(2.1) 


where  coif  is  from  definition,  given  by 


«>if  =  Orf  -  ffl/o 


(2.2) 


Here  ff>i0  is  the  local  oscillator  frequency. 

To  show  the  image  problem  mathematically,  we  model  the  downconversion 
(mixing)  process  to  be  mathematically  equivalent  to  multiplication  (a  more  com- 
plete mathematical  treatment  of  mixing  is  carried  out  in  Chapter  4).  Hence  we  can 
write  the  IF  signal  as 


Vif(t)  =  Arf  coscorft  x  Ai0coscoi0t 


(2.3) 


where  Vrf  =  Arf  coscorft  denotes  the  desired  signal  whose  frequency  is  at  a>rf  and 
Vio  =  Aio  cosa>i0t  is  the  local  oscillator  signal.  For  simplicity  we  assume  that 
Aio=Art=A  and  hence  we  have 


Vif(t)  =  AcoscOrft  x  Acoscoi0t 


(2.4) 
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Next  we  apply  trigonometric  manipulations  and  rewrite  (2.4)  as  follows: 
1 


A2 
1 


Vif  (t)  =  -  A2(cOs(©rf  +  C0io)t  +  COs(©rf  -  G)io)t) 


A2(cos(corf  +  coi0)t  +  cos  (Oift)  (2.5) 

The  second  term  is  our  term  of  interest.  What  happens  if  the  received  signal 
consists  of  the  desired  signal  accompanied  by  a  strong  interferer  at  the  image 
frequency  ©image?  Assume  that  the  image  is  given  by  Vimage=Aimagecos(coimaget)= 
Acos((Bimaget),  where  we  have  again  assumed  for  simplicity  that  Aimage=A.  Then 
upon  downconversion  (mixing)  the  interferer  generates  the  downconverted  image: 

AcoscOjmaget  x  A  cos  (Bi0t  (2.6) 

Next  we  will  eliminate  the  term  ©rf  in  (2.1),  and  (2.2)  and  express  ©image  as: 

©image  =  ©to  ~  ©if-  (2.7) 

Substituting  this  in  (2.6)  and  applying  trigonometric  manipulations  again,  the 
downconverted  image  becomes 

-A2  (C0S(©10  -  ©if  +  ©lo)t  +  C0S(©10  -  ffljf  -  ©i0)t) 

=  -A2(cos(2©i0  —  ©if)t  +  cos©jft)  =-A2(cos(2©i0  —  ©if)t)  +-A2cos©ift 

(2.8) 

Comparing  (2.8)  with  (2.5)  one  can  see  that  both  of  their  second  terms  are  at  ©if 
and  so  the  downconverted  image  does  indeed  lie  on  top  of  the  downconverted 
desired  signal.  Thus  if  there  is  a  strong  interference  at  the  image  frequency,  then  at 
the  IF  there  will  be  a  strong  interference  sitting  on  top  of  the  desired  signal, 
resulting  in  a  serious  degradation  of  SNR.  Notice  that  no  amount  of  filtering  can 
help  the  situation  after  the  downconversion  process.  Thus,  we  have  to  ensure  that 
the  image  is  filtered  before  downconversion. 

When  would  a  strong  interference  be  present  at  the  image  frequency?  Since  the 
image  frequency  is  only  2©if  from  the  desired  signal,  if  ©if  is  small  (low  IF),  then 
this  interference  can  come  from  adjacent  channels  or  transmit  bands  of  the  same 
standard  (for  standards  using  FDD).  If  ©if  is  large,  this  interference  can  come  from 
transmit  bands  of  other  wireless  standards. 

To  get  rid  of  this  image,  we  should  place  BPF2  in  front  of  the  mixer.  The  center 
frequency  of  BPF2,  fflcenter_BPF2>  is  typically  around  the  LO  frequency.  If  BPF2  has 
a  bandwidth  as  shown  in  Figure  2.3,  then  the  image  will  be  filtered  out.  How  do  we 
determine  the  bandwidth  of  this  filter? 

The  answer  depends  on  whether  we  have  a  low  IF  or  high  IF  front  end,  as 
the  frequencies  of  the  interferers  are  different.  As  an  example  we  will  treat  the  low 


2.3     Filter  Design 


71 


Frequency 
response 


Frequency 
response 


-►CO 


1234567    910     channels 


COj, 


co,„ 


«ioio 


-►CO 


Band 


Channel  10  +  Channel  6 


CO, 


-►CO 


Fig.  2.4    Diagram  showing  problem  of  image  and  function  of  BPF2 

IF  case.  In  this  case,  the  interference  comes  primarily  from  an  adjacent  channel. 
Let  us  redraw  Figure  2.3  in  Figure  2.4,  but  including  adjacent  channels.  Figure  2.4 
is  drawn  with  10  channels  and  illustrates  the  case  for  the  DECT  standard.  Here 
different  users  are  separated  using  FDMA  techniques.  To  maintain  the  same  IF  when 
the  desired  channel  frequency  changes,  we  can  change  the  LO  frequency  accordingly. 
We  stated  that  cocentei._BpF2  is  approximately  the  same  as  the  LO  frequency.  One  way 
to  implement  BPF2  is  to  make  cocenterBpF2  exactly  the  same  as  the  LO  frequency. 
However,  since  the  LO  frequency  changes,  this  arrangement  necessitates 
that  cocenter_BPF2  be  variable,  which  is  impractical  because  implementing  bandpass 
filters  with  a  variable  center  frequency  at  high  frequency  is  very  difficult.  A  better 
alternative  (which  is  our  choice  here)  is  to  let  cocenter_BPF2  remain  fixed. 

Next  we  determine  the  bandwidth  of  a  BPF2  that  has  a  fixed  center  frequency. 
Referring  to  Figure  2.4  we  show  that  the  10  channels  in  DECT  are  frequency 
multiplexed  over  a  band  that  spans  from  1880  to  1900  MHz.  First  let  us  try  to 
select  channel  1.  We  do  this  by  setting  the  LO  frequency  to  be  C0iol.  This  will  allow 
us  to  select  the  first  channel.  Now  let  us  arbitrarily  choose 


C0jf  =  2  x  coc 


(2.9) 


where  cocnannei  is  the  channel  frequency. 
Substituting  (2.9)  in  (2.7),  we  have 


COi, 


t»lo_l 


(0,1 


raio_i  —  2  x  cochannel 


(2.10) 
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From  Figure  2.4  there  is  no  interference  at  that  frequency  and  we  do  not  have  a 
problem. 

Next  we  try  to  select  some  higher  number  channels.  To  do  this  we  increase  the 
LO  frequency,  and  the  image  frequency  (at  oolo — if)  starts  to  increase  and  eventually 
may  move  into  the  band.  At  that  point  some  other  channels  of  the  band  will  become 
interferers.  This  is  what  we  call  adjacent  channel  interference.  For  example, 
referring  to  Figure  2.4,  we  select  the  tenth  channel  by  setting  the  LO  frequency  to 
be  (Biolo;  hence 

CBrf  =  COiu-th-channel'  (2-11) 

Substituting  (2.11),  and  (2.9)  into  (2.1),  the  image  will  now  be  at  a  frequency 
given  by 

K>image  =  ©lO-th-channel  —  4  X  Cflchannel  =  ©6-th-channel  •  (2.12) 

Hence  the  sixth  channel  would  be  downconverted  to  the  same  IF  frequency  as 
the  tenth  channel.  To  get  rid  of  this  image  we  should  set  BPF2's  bandwidth  to  be 
less  than  4x<»charmei.  This  means  that  BPF2  has  a  center  frequency  a>center_BPF2  = 
©8-th-channei  and  sPan  from  co6_th_channei  upward.  The  lower  cutoff  frequency  is  then 
©6-thchannet-  However,  this  creates  a  problem.  For  example,  if  the  user  being 
assigned  this  tenth  channel  is  later  reassigned  to  channel  5,  then  corf  =  co5_th_channei 
This  is  below  <d  and  the  user's  channel  will  be  cut  off.  We  can  avoid  this 

6-th- channel 

problem  by  making  sure  that  under  the  worst  case,  as  the  LO  frequency  varies,  the 
resulting  image  frequency  never  falls  inside  the  band. 

What  is  the  worst  case?  Referring  to  Figure  2.4  the  worst  case  happens  when  the 
desired  channel  is  at  the  highest  number  channel,  or  channel  10.  Under  this  worst 
case  we  have 

fi>rf  =  K>10-th-channel  (2.13) 

We  want  to  place  the  desired  signal's  frequency  at  the  upper  cutoff  frequency  of 
BPF2  so  the  upper  cutoff  frequency  of  BPF2  is  given  by: 

G>uppeLcutoff  =  ©10-th-channel  (2.14) 

The  lower  cutoff  frequency  <DiOWer_cutoff  of  BPF2,  cocut0ff,  should  be  larger  than 
the  image  frequency: 

^lower_cutoff  ^^image  V-^*^^/ 

However,  in  order  not  to  cut  off  any  channel,  this  lower  cutoff  frequency  also  has 
to  satisfy 

fi>lower_cutoff  <Ol-th-channel  (2. 16) 
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As  a  minimum  condition  we  set 

©lower_cutoff  =  ©1-th-channel  (2.17) 

Meanwhile,  the  bandwidth  of  BPF2,  ©bw_BPF2> is  given  by  definition  as 

©bw_BPF2  =  fi>upper  cut0ff  —  ©lower  _cutoff  (2.18) 

Then,  if  we  substitute  (2.14),  and  (2.17)  in  (2.18),  we  will  have 

©bw_BPF2  =  ©10-m-channel-©l-th-channel  =  ©band  (2.19) 

Here  fflband  is  the  bandwidth  of  the  band. 

For  DECT,  ©band  is  around  2%  x  20  Mrad/s  and  corf  is  around  2%  x  1.9  Grad/s. 
We  assume  that  co,0  is  close  to  corf  or  close  to  2%  x  1.9  Grad/s.  Hence  fl)Center_BPF2  is 
close  to  2k  x  1.9  Grad/s. 

Hence  we  need  a  filter  centered  at  close  to  2%  x  1.9  Grad/s  with  a  2%  x  20 
Mrad/s  bandwidth.  The  amount  of  suppression  must  be  large  enough  that  in  the 
presence  of  the  strongest  adjacent  channel  interference,  the  resulting  SNRdemodjn  is 
larger  than  its  required  value.  With  these  specifications  BPF2  is  typically  designed 
using  a  surface  acoustic  wave  (SAW)  filter  or  ceramic  filter.  The  output  matching 
network  of  the  LNA  can  also  be  used  to  provide  some  of  this  filtering,  as  shown  in 
Chapter  3. 


2.3.3     Channel  Filter  (BPF3) 

First  we  want  to  find  the  center  frequency  of  BPF3,  which  is  the  same  as  the  IF. 
To  find  IF  we  do  the  following:  from  (2.15)  coimage<  colower_cutoff.  As  a  minimum 
condition  we  have 

©image  —  ©lower_cutoff  \L.L\j) 

(the  filter  barely  suppresses  the  image).  Now  we  apply  the  worst  case  scenario  as 
described  in  equations  (2.13)  through  (2.19).  Substituting  (2.17)  in  (2.20),  we  have 

©image  —  ©l_st_channel  (2.21) 

Substituting  (2.13),  and  (2.21)  into  (2.1),  we  finally  have 

©10-th-channel  —  ©1-st-channel  =  2ff>if  OY  COjf  =  (©lO-th-channel  —  ©l-st-channel)/2  =  ©band/2 

(2.22) 
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As  an  example,  for  DECT,  using  (2.22)  the  IF  should  be  larger  than  2n  x  10 
Mrad/s.  However,  this  would  put  a  stringent  requirement  on  the  rolloff  of  BPF2. 
Hence  the  IF  is  usually  increased  substantially  beyond  this  bound.  In  the  present 
case  we  set  IF  to  be  2%  x  100  Mrad/s: 

©if  =  2n  x  100  Mrad/s  (2.23) 

Therefore,  the  center  frequency  of  BPF3,  cocenter_BPF3>  is 

raCenter_BPF3  =  27i  x  100  Mrad/s  (2.24) 

Next  we  want  to  find  the  BW  of  BPF3.  Since  at  this  point  we  are  at  a  low  enough 
frequency  to  do  channel  filtering,  without  fearing  an  unrealizable  Q,  we  will 
attempt  to  do  so.  Hence  the  bandwidth  of  BPF3  is  simply  the  channel  bandwidth,  or 

<»bw_BPF3  =  (Bchannei  =  2n  x  1.728  Mrad/s  (2.25) 

For  the  Q  required  [for  DECT  standard  and  our  choice  of  intermediate  frequency 
(IF),  we  have  Q  =  (2n  x  100  mMrad / s) / {In  x  1.728  Mrad/s)  ^  60]  and  the  cen- 
ter frequency  specified  (2ti  x  100  Mrad/s),  this  filter  may  be  designed  with  an  active 
filter  (continuous  time  filter)  or  ceramic  filter.  The  output  network  of  the  mixer  can 
also  provide  some  channel  filtering. 

2.3.3.1     Trade-Off  Between  BPF2,  and  BPF3 

We  noted  that  the  choice  of  IF  frequency  affects  both  BPF2  and  BPF3.  We  discuss 
this  in  a  bit  more  detail  here.  First  we  eliminate  the  C0band  term  between  (2.19)  and 
(2.22)  and  show  that 

ff>bw_BPF2  =  2a»if  (2.26) 

Hence  if  coif  increases  then  C0bw_BPF2  aiSO  becomes  larger.  Consequently,  Qbpf2> 
which  is  given  roughly  by  colo  /  cibw_BPF25  becomes  smaller  and  the  design  of  BPF2 
becomes  more  relaxed.  Meanwhile,  let  us  look  at  Qbpf3>  which  is  given  roughly  by 
©if  /  C0bw_BPF3-  Substituting  (2.25)  in  this  equation,  we  have  Qbpf3  —  fflif  /  ^channel- 
Hence  a  larger  coif  means  that  Qbpf3  becomes  larger,  making  BPF3  more  difficult 
to  design. 


2.4    Rest  of  Receiver  Front  End:  Nonidealities 
and  Design  Parameters 

Now  that  we  have  talked  about  the  design  of  filters  in  the  receiver  front,  we  turn  our 
attention  to  the  design  of  the  rest  of  the  components.  Normally  these  components 
consist  of  circuits  such  as  LNA,  mixer,  IF  amplifier,  and  analog/digital  (A/D) 
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converter.  Unlike  filters,  their  relevant  design  parameters  are  different.  Hence  our 
first  task  is  to  discuss  these  design  parameters. 

We  start  by  noting  that  these  design  parameters  are  intimately  related  to  the 
nonidealities  in  these  circuits.  First,  these  circuits  are  nonlinear.  The  primary  effect 
of  this  nonlinearity  is  that  it  can  frequency  translate  interference  with  frequencies 
outside  the  channel  frequency  onto  the  channel  frequency,  thereby  corrupting  the 
desired  signal.  Since  this  interference  behaves  like  noise,  this  will  degrade  the  SNR 
at  the  output  of  the  front  end.  Second,  these  circuits  introduce  noise  of  their  own. 
Again,  this  will  degrade  the  SNR  at  the  front  end's  output.  In  the  following  sub- 
section we  translate  the  effects  of  these  two  nonidealities  into  the  corresponding 
design  parameters. 


2.4.1     Nonlinearity 

In  this  subsection  we  limit  our  analysis  to  nonlinearities  up  to  the  third  order,  because 
normally  these  are  the  nonlinearities  of  most  interest  in  a  radio  environment. 
For  simplicity  we  further  assume  that  these  nonlinearities  are  memoryless.  Mathemat- 
ically, memoryless  third-order  nonlinearities  are  specified  by  a  third-order  polyno- 
mial, which  characterizes  the  input-output  relation  of  a  memoryless  nonlinear  system: 

y(t)  =  ais(t)  +  a2s2(t)  +  u3s3(t)  (2.27) 

Here  s(t)  is  the  input  signal  and  y(t)  is  the  output  signal  and  the  nonlinearity  is 
memoryless.  Because  of  this  nonlinearity,  distortion  is  generated. 


2.4.1.1     Harmonic  Distortion 

Using  the  input-output  relation  (2.27)  with  a  single  tone  at  the  input  s(t)  =  A  cos  coot, 
the  output  of  the  nonlinear  systems  can  be  viewed  mathematically  as 

y(t)  =  a.\A  cos  (a0t  +  a2A2  cos2  <x>0t  +  a3A3  cos3  co0f 

a2A2      (  3a3A3\  u2A2  a3A3 

tXiA  H —    cosgVH — —  cos2co0H — cos3co0f    (2.28) 


Harmonic  distortion  is  defined  as  the  ratio  of  the  amplitude  of  a  particular 
harmonic  to  the  amplitude  of  the  fundamental.  For  example,  third-order  harmonic 
distortion  (HD3)  is  defined  as  the  ratio  of  amplitude  of  the  tone  at  3coo  to  the 
amplitude  of  the  fundamental  at  co0.  Applying  this  definition  to  (2.28)  and  assuming 
a;A  >  (3a3A3)/4,  we  have 

HD3  =  -  —A2  (2.29) 

4  ai 
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Fig.  2.5    Frequency  spectrum  of  input,  output  of  the  nonlinear  system 


Next  we  take  the  Fourier  transform  of  (2.28). 


y(co)  =  a2A  7k5(co)  +  n    a^A 


3ot3A; 


[(5(o)  -  co0)  +  <5(co  +  co0)] 


7i  —^-  [<5(ro  —  2co0)  +  <5(co  +  2co0)] 


2 
a3^3 


[<5(co  -  3co0)  +  <5(co  +  3co0)] 


(2.30) 


Equation  (2.30)  is  plotted  in  Figure  2.5. 

Harmonic  distortion  is  typically  not  of  major  concern.  As  an  example,  for  DECT, 
a>0=2n  x  1.9  Grad/s.  Suppose  that  the  LNA  in  Figure  2.2  is  nonlinear  and  generates  a 
second  harmonic  distortion.  However,  this  is  at  2co0  or  2tc  x  3.8Grad/s  and  will  be 
filtered  by  BPF2,  hence  posing  no  harm. 


2.4.1.2     Intel-modulation 


Intermodulation  arises  when  more  than  one  tone  is  present  at  the  input.  A  common 
method  for  analyzing  this  distortion  is  the  "two-tone"  test.  We  assume  that  two 
strong  interferers  occur  at  the  input  of  the  receiver,  specified  by  s(t)  =  A^oscoit  + 
A2cosco2t.  Again,  the  intermodulation  distortion  can  be  expressed  mathematically 
by  applying  s(t)  to  (2.27). 


y(t)  =  a. i (A i  coscoif  +  A2COSCO2?)  +  a,2(A\  coscoi?  +  A2COSCO2?) 
+  a.i(Ai  cosrai?  +  A2COSCO2'') 


(2.31) 


Using  trigonometric  manipulations,  we  can  find  expressions  for  the  second  and 
the  third-order  intermodulation  products  as  follows: 


CO!  ±  co2  :  (X.2A1A2  cos(co1  +  co2)f  +  ot2AiA2  cos(co1  —  co2)f 


2©!  ±  CO2 
2(B2  ±  COl 


3a3/t^4, 


3a3A§A, 


cos(2co1  +  (a2)t+  a3  '  2  COS(2CO!  —  C02)? 


(2.32) 


cos(2cc>2  +  C0i)f- 


SajAJAt 


cos(2a>2  —  COl)? 
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Fig.  2.6  The  effect  of  the  intermodulation  distortion  in  the  frequency  domain,  where  the 
frequencies  represent  the  following  signals:  co0:  desired  signal;  CO],  co2:  strong  interferers;  2<Di, 
2<s>2-  harmonics  of  the  interferers;  coiictb:  second  order  intermodulation  products;  2(»ij2±<fl2,i: 
third  order  intermodulation  products 

The  output  spectrum  in  the  frequency  domain  can  be  determined  from  (2.32)  by 
evaluating  its  Fourier  transform  Y(o3).  This  is  shown  in  Figure  2.6. 

It  can  be  seen  from  Figure  2.6  that  the  intermodulation  product  with  frequency 
2032—03!  (denoted  as  the  third  order  intermodulation  product,  ID3)  lies  at  030  and 
corrupts  the  desired  signal  at  03o.  Furthermore  03i,032  are  close  to  03o  and  so  trying  to 
filter  them  out  requires  a  filter  bandwidth  that  is  very  narrow  and  is  impractical. 
Hence  keeping  down  2032—03!  by  keeping  the  nonlinearity  (which  generates  them  in 
the  first  place)  is  the  only  solution. 

Where  do  the  two  tones,  at  03i  and  k>2,  come  from?  They  can  be  any  one  of  the 
interferences  described  in  sub-section  2.2.1.  Strictly  speaking,  the  interferers  are  not 
tones  but  are  more  like  narrowband  noise.  For  simplicity,  for  the  time  being  we 
represent  the  intereferer  at  the  band  from  (Q\  — 03channel/2  to  Cfli-|-G3Channei/2  as  a  single 
tone  centered  at  031;  with  the  rms  value  of  the  narrowband  noise  set  equal  to  the 
amplitude  of  the  tone,  or  AL  Similar  representation  is  applied  to  the  desired  signal  at 
the  band  from  030— 03channel/2  to  O30+03channel/2  and  the  interferer  at  the  band  from 
032— ©channei/2  to  o^+cOchanneiA  To  quantify  this  distortion  we  first  define  the  third- 
order  intermodulation  distortion,  IM3,  as  the  ratio  of  the  amplitude  of  the  ID3  to  the 
amplitude  of  the  fundamental  output  component  (denoted  as  Im)  of  a  linear  system 
given  by  y(t)  =  aiAcoso30t,  where  ai  is  the  linear  small  signal  gain.  Mathematically, 
this  is  written  as 


/M3 


Itx/Idi 


(2.33) 


Note  that  IM3  expressed  in  decibels  is  simply  the  difference  between  the  interferer's 
fundamental  output's  signal  strength  in  decibels  (at  030)  and  the  interferer's 
intermodulated  product's  strength  in  decibels  (at  2o32,i— 0)1,2). 

In  order  to  quantify  IM3,  let  us  simplify  by  assuming  A  =  Ax  =  A2.  Applying 
(2.31)  and  (2.32)  to  (2.33)  we  get 


7M3 


0L\A 


3  a3    2 

: A 

4  ai 


(2.34) 
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Gain 


3/4a3A3 


OIP,  •  - 


a, A 


601og(3/4a3A) 


P;[dBm] 


Fig.  2.7    a)  The  linear  gain  (aiA)  and  the  nonlinear  component  (3/4a3  A3),  b)  Graphical 
representation  of  the  input  and  output  third-order  intercept  point  (IIP3,  OIP3) 


Comparing  (2.29)  to  (2.34),  it  is  seen  that 

IM3  =  3HD3. 


(2.35) 


Since  IM3  depends  on  input  level  and  is  sometimes  not  as  easy  to  use,  we  define 
another  performance  metric,  called  the  third  order  intercept  point  (IP3). 


2.4.1.3     Third-Order  Intercept  Point,  IP3 

From  (2.31),  we  note  that  as  the  input  level  A  increases,  the  desired  signal  at  the 
output  is  proportional  to  A  (by  the  small  signal  gain  tXi).  On  the  other  hand,  from 
(2.32)  we  can  see  that  the  third-order  product  ID3  increases  in  proportion  to  A  . 
This  is  plotted  on  a  linear  scale  in  Figure  2.7a.  Figure  2.7a  is  replotted  on  a 
logarithmic  scale  in  Figure  2.7b,  where  power  level  is  used  instead  of  amplitude 
level.  As  shown  in  Figure  2.7b  the  power  of  ID3  grows  at  three  times  the  rate  at 
which  the  desired  signal  ID1  increases.  The  third-order  intercept  point  IP3  is  defined 
to  be  the  intersection  of  the  two  lines. 

From  Figure  2.7b  we  can  see  that  the  amplitude  (in  voltage)  of  the  input 
interferer  at  the  third-order  intercept  point,  AiP3,  is  defined  by  the  relation 


(2.36) 


201og(a1A//,3)  =  20 log (  ~-a3A,P3 


From  (2.36)  we  can  solve  for  A1P3: 


(2.37) 


For  a  50  Q  load,  we  define  the  input  third-order  intercept  point  (IIP3)  as 
IIP3=AIp32/50  Q.  (IIP3  is  hence  interpreted  as  the  power  level  of  the  input  interferer 
for  a  50  Q  load  at  the  third-order  intercept  point).  Notice  that  IIP3  can  be  interpreted 
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Fig.  2.8  Two  cascaded  nonlinear  stages,  where  ai^i  and  ai2  are  the  linear  gains  of  stage  1  and 
2  and  IIP3-1  and  IIP32  denote  the  third-order  intercept  points  of  stages  1  and  2,  z^t)  denotes  the 
output  signal  of  the  first  stage,  and  z2(t)  denotes  the  output  signal  of  the  second  stage 


in  terms  of  absolute  value  or  decibels.  One  useful  equation  that  relates  IIP3  to  IM3, 
expressed  in  decibels,  is  the  following  [2]: 


IIPi\dBm  -Pi\dBn 


IMi\ 


(2.38) 


Here  P;  is  the  power  level  of  the  input  interferer  and  is  typically  defined  for  a  50Q 
load.  Both  IIP3  and  Pj  have  been  expressed  in  dBm,  whereas  IM3  is  expressed  in  dB. 


2.4.1.4     Cascaded  Nonlinear  Stages 

Next  we  investigate  the  impact  of  the  nonlinearity  of  each  stage  on  the  overall 
nonlinearity  performance  of  the  front  end.  To  get  a  useful  design  relation,  we  want 
to  find  the  overall  third-order  intercept  point  at  the  input  in  terms  of  the  IIP3  and  the 
gain  of  each  stage.  As  a  start,  let  us  assume  that  we  have  a  cascade  of  two  nonlinear 
stages,  as  shown  in  Figure  2.8. 

Using  the  input-output  relation  (2.27),  we  can  approximate  the  nonlinear  behav- 
ior of  each  subcomponent  as  follows: 


zj(f)  =  ai,is(i)  +  02,is2(t)  +a3As3(t) 
z%{t)  =  au2zi{t)  +a2,2^(t)  +a32z](t) 


(2.39) 
(2.40) 


Here  ai j  means  the  i  th  order  gain  of  the  j  th  stage.  For  example,  a3  2  is  the  a3  gain  of 
the  second  stage. 

Now  we  can  write  the  output  of  the  second  stage  z2(t)  in  terms  of  the  input  signal 
s(t)  by  applying  (2.39)  to  (2.40): 


Z2<»  =  ai,2(ai,lS(»  +  02,1s2  (t)  +  «3,l*3(f)) 

+  a2,2(oi,is(t)  +  a2,is2(t)  +  axls3(t)f 
+  o3,2(ai,is(t)  +a2tls2(t)  +axis3(t)) 


(2.41) 


To  determine  the  third-order  intercept  point,  we  need  to  calculate  the  linear  and 
the  third-order  terms  at  the  output  of  the  second  stage.  From  (2.41)  it  can  be  shown 
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that  the  linear  term  of  z2(t)  equals  a;  ja;  2s(t)  and  that  the  third-order  term  equals 
[a3,iai,2  +  2a11a2iia2?2  +  au  a3?2]s~(t). 

Since  (2.41)  describes  the  overall  input-output  relation  of  the  two-stage  system, 
we  can  now  treat  it  as  one  single  stage.  We  can  then  reuse  the  previous  formulae 
derived  for  a  single  stage  to  determine  the  AIP3  of  this  two-stage  system.  This  consists 
of  setting  oti  =  a;  ^  2  and  oc3  =  a3  ^  2  +  2a;  ja2 ja2  2+a;  t3a3  2  in  (2.37).  Thus  the 
overall  AIP3  of  the  cascaded  system  can  be  expressed  as 


'\ 

4 
3 

at,i«t,2 

IP3 

a3,lal,2  +  2fliiifl2,lfl2,2  +  «i,l«3,2 

(2.42) 


Unfortunately,  the  sign  of  the  coefficient  in  the  denominator  in  (2.41)  is  circuit 
dependent.  Considering  the  worst  case,  we  add  the  absolute  values  together. 
Rearranging,  (2.41)  becomes 


1 

3 
~4 

|«3,1 

01,2  +  \2aLia2,ia22\  + 

4 

,1^3,2 

A2 
AIP1 

fll,l«l,2 

3 
~4 

«3,1 
«1,1 

3 

+  2 

fl2, 1^2,2 

«i,2 

3     2 

+  4<i 

«3,2 
Ol,2 

(2.43) 


Remember,  our  goal  is  to  express  the  overall  AIP3  in  terms  of  the  A1P3  and  the 
linear  gain  of  each  stage.  Comparing  the  first  term  of  (2.43)  and  (2.37),  it  can  be 
seen  that  first  term  gives  the  reciproal  of  the  square  of  AIP3 ;1,  the  A1P3  of  the  first 
stage.  Following  the  same  reasoning,  the  third  term  is  seen  to  consist  of  the  product 
of  the  reciproal  of  the  square  of  AIP3  2,  the  AIP3  of  the  second  stage  and  the  square  of 
aii,  the  linear  gain  of  the  first  stage.  Hence  (2.43)  can  be  rewritten  as 


1 


1 


3a2.ta2.2 


AIP3Z 


AiP3,i2 


\1a 


1.2 


AfP3' 


(2.44) 


We  can  make  further  approximations  by  examining  the  second  term  in  (2.44). 
The  coefficient  a2  ;  in  the  second  term  only  contributes  to  the  second-order  inter- 
modulation  product  and  harmonic  distortion.  As  shown  in  Figure  2.6,  since  the  front 
end  operates  primarily  in  a  narrow  frequency  band,  the  second-order  intermodula- 
tion  products  and  the  harmonic  distortion  terms  lie  outside  the  band  of  interest  and 
are  thus  strongly  attenuated  by  the  BPFs  in  the  front  end.  Hence  the  impact  of  a2  i  in 
generating  the  overall  front  end's  distortion  component  is  small.  Consequently,  in 
determining  the  overall  front  end's  AiP3,  the  second  term  can  be  neglected.  Accord- 
ingly, (2.44)  can  be  further  simplified  as 


1 


1 


•i,i 


A1Piz 


Am.x2 


AIP3,2 


(2.45) 
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Fig.  2.9  N  cascaded  nonlinear  stages,  where  ay  . . .  a[  n  are  the  linear  gains  of  stage  1. .  .n  and 
IIP3  1  ...  IIP3n  denote  the  third  order  intercept  points  of  stage  1  ...  n,  Z[(t)  denotes  the  ouput  signal 
of  the  first  stage,  and  zn(t)  denotes  the  output  signal  of  the  n  th  stage 


Equation  (2.45)  leads  to  a  useful  observation:  since  a^i  is  large,  (2.45)  is 
essentially  saying  that  the  overall  AIP3  is  dominated  by  the  A1P3  of  the  second 
stage.  Physically,  this  is  what  happens:  in  the  case  where  the  linear  gain  altl  of  the 
first  stage  is  large,  the  input  signal  to  the  second  stage  is  large.  Thus,  the  distortion 
of  the  second  stage  becomes  more  critical  since  it  has  to  handle  a  larger  input  signal. 

Next  (2.45)  can  be  extended  to  a  cascade  of  an  arbitrary  number  of  nonlinear 
stages,  as  shown  in  Figure  2.9,  and  the  resulting  overall  input  intercept  point  can 
be  derived  to  be 


1 


1 


■  +  - 


•1,1 


M,l"l,2 


Awi2      A//.3J2      AIP3/      AIPiy 


•l.n-1 


Am/ 


(2.46) 


Equation  (2.46)  can  also  be  restated  in  a  form  that  involves  IIP3.  For  a  50  Q  load 
we  stated  that  IIP3  is  equal  AIP32/50  and  hence  IIP34  equals  A1P3  ;2/50,  where  i  is  the 
stage  number.  Substituting  these  relations  in  (2.46)  we  have 
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1 


UP-i        IIP3  1        UP  3 ; 


2       2 
"1,1"1,2 

HP33 


M.l"l,2 


►l.B-1 


llPi,n 


(2.47) 


Under  the  assumption  that  the  subcomponents  are  matched  (in  order  to  allow 
maximum  power  transfer),  G;,  the  power  gain  of  each  stage,  can  be  expressed  in 
terms  of  the  voltage  gain  of  each  stage  as 

G\  =  Power  gain  of  the  first  stage  =  al  j 
G2  =  Power  gain  of  the  second  stage  =  al  2 

where  i  is  the  stage  number.  (As  a  side  note,  the  exact  relation  between  the  power 
gain  and  the  voltage  gain  will  be  clarified  in  Problem  2.5).  Substituting  this  in 
(2.47),  we  get 


1 

UP3 


1 


gt 

11P3.2 


G1G2 
up™ 


G1G2  ■  ■   G„_i 

HP3,n 


(2.48) 
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Equation  (2.48)  is  a  useful  approximation  of  the  overall  performance  of  the 
system  due  to  the  nonlinear  behavior  of  each  stage.  Essentially,  it  says  that  the 
overall  input  intercept  point  is  dominated  by  distortion  of  the  last  stage. 

Finally,  the  overall  output  intercept  point,  OIP3,  can  be  expressed  in  terms  of  the 
output  intercept  point  (OIP3,)  and  power  gain  (G,)  of  each  stage  (/  =  2,3,  . .  .,n), 
where  n  =  number  of  stages.  First  we  start  with  (2.47)  and  divide  both  sides  of 
(2.47)  by  au2-alf22-. .  .-a^2: 


all---alnIIP3         all---alJIP3A        ai2---alnIIP3,2  alnI1P3,n      (2.49) 

Next  we  relate  the  output  intercept  point  to  the  input  intercept  point.  For  the 
output  intercept  point  of  each  stage,  OIP3 ,,  we  have 

OlPy  =  a^JIPli  (2.50) 

where  i—  \,n.  For  the  output  intercept  point  of  the  overall  front  end,  OIP3,  we  have 

OIP3  =  total  .gain  x  IIP3  =a\x...  a2lnIIP3  (2.51) 

Substituting  (2.50)  and  (2.51)  into  (2.49),  we  get 

11  1  1 


OIP3       a2l2---a2ln_1OIPxl      a\i---a\n_lOIPX2  01PX„ 


(2.52) 


Again,  under  the  assumption  that  the  subcomponents  are  matched  (in  order  to 
allow  maximum  power  transfer),  (2.52)  can  be  expressed  in  terms  of  the  power 
gains  of  each  stage: 

11  1  1 

(2.53) 


OIPi      G2  ■  ■  ■  GnOIP3,i      G3-G„OIP3,2  OIP3i„ 

The  importance  of  (2.53)  is  that  it  tells  us  that  the  output  intercept  point  of  the 
overall  front  end  is  dominated  by  the  output  intercept  point  of  the  last  stage. 


2.4.1.5     Gain  Compression 

Another  phenomeon  caused  by  the  nonlinearity  of  the  receiver  is  called  gain 
compression.  When  the  input  to  an  amplifier  is  large,  the  amplifier  saturates, 
hence  clipping  the  signal.  When  the  strength  of  the  input  is  further  increased,  the 
output  signal  is  no  longer  amplified.  At  this  point,  the  output  is  said  to  be 
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compressed.  We  may  now  ask  what  the  clipping  of  a  signal  has  to  do  with  the 
nonlinear  behavior  of  a  system.  If  we  go  back  to  (2.28),  we  observe  that  in  y(t)  there 
are  two  terms  with  frequency  co0  due  to  the  nonlinear  behavior.  Let  us  assume  that 
the  other  terms  in  y(t)  have  frequency  outside  the  band  of  interest  and  hence  are 
removed  by  the  BPFs.  Thus,  y(t)  becomes 


y(t)  =     M  + 


3a3A: 


COSCOo?  =       al 


3<x3A< 


A  cos  c>o? 


(2.54) 


In  the  case  where  oo3  is  negative,  the  second  term  is  decreasing  the  gain.  As  the 
input  starts  to  increase,  the  impact  of  the  second  term  becomes  important  in  the 
sense  that  it  saturates  the  active  device.  To  get  a  feeling  for  the  input  level  when 
considerable  gain  compression  occurs,  we  can  use  the  concept  of  the  1-dB 
compression  point,  defined  as  the  input  level  that  causes  the  linear  small-signal 
gain  to  drop  by  1  dB.  Thus,  the  A!_dB  specifies  the  amplitude  (in  voltage)  of  the 
input  signal  when  the  linear  voltage  gain  drops  by  1  dB.  From  (2.54)  we  see  that  the 
1-dB  compression  point  can  be  expressed  mathematically  by 


«i 


^A\-db 


=  0L\ 


dB 


\dB 


(2.55) 


We  can  rewrite  (2.55)  in  terms  of  decibels: 


20  log 


ai 


3aiA2l_ 


db 


201og|«i|  -20 log  1.122 


(2.56) 


From  (2.56)  the  A!_dB  input  level  is  given  by 


n-dB 


/  0.145 


(2.57) 
The  idea  of  the  1-dB  compression  point  is  shown  graphically  in  Figure  2.10. 


2.4.1.6     Blocking 

A  phenomenon  closely  related  to  gain  compression  is  blocking.  So  far,  the  com- 
pressive behavior  of  a  nonlinear  subcomponent  with  a  single  input  signal  has  been 
discussed.  What  happens  if  a  weak  desired  signal  along  with  a  strong  interferer 
occurs  at  the  input  of  a  compressive  subcomponent?  Assume  that  we  have  the  input 
signal  s(t)  —  A0cos(£>0t  +  /^cosov  where  A1  is  a  strong  interferer  and  A0  is  the 
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Fig.  2.10    Illustration  of  the 
1-dB  compression  point 


201og(Aout) 


1-dB  compression  point 


desired  signal.  Applying  this  s(t)  to  (2.27)  we  can  express  the  output  terms  of 
interest  (the  term  at  the  fundamental  frequency)  as 


y(t)  =     aiA0  + 


3a3Ao      3xiA0A] 


cos  (0ot  - 


(2.58) 


If  the  interferer  strength  is  much  greater  than  the  desired  signal  strength  (that  is, 
Ai^S>Aq),  equation  (2.58)  can  be  simplified  as 


y(t) 


(X\ 


3oc3A? 


Aq  cos  a>0t 


(2.59) 


If  a3  is  negative,  the  small  signal  gain  is  attenuated  by  the  interferer.  If  the 
attentuation  becomes  so  large  that  the  overall  gain  drops  to  zero,  we  say  that  the 
signal  is  blocked.  Many  receivers  must  be  able  to  withstand  blocking  signals  up  to 
70  dB  greater  than  the  desired  signal. 

To  summarize,  the  effect  of  gain  compression  and  blocking  is  that  the  desired 
signal  amplitude  is  reduced,  and  this  results  in  a  degraded  SNRdemodin. 


2.4.2    Noise 


There  is  circuit  noise  internal  to  the  subcomponents  in  the  front  end.  This  noise  will 
add  on  to  the  AWGN,  cause  interference,  and  further  degrade  the  SNR.  To  explore 
this  effect  we  have  to  treat  the  circuit  noise  aspect  in  a  more  comprehensive  manner. 
First  we  talk  about  noise  sources.  Circuit  noise  is  associated  with  the  electrical 
components  that  build  the  subcomponents,  such  as  resistors  and  transistors.  Circuit 
noise  is  further  subdivided  into  thermal,  shot,  and  flicker  noise. 
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2.4.2.1     Noise  sources 

In  this  subsection  we  define  the  effects  of  the  circuit  noise.  The  noise  phenomena 
considered  here  are  caused  by  the  small  current  and  voltage  fluctuations  that  are 
generated  within  the  devices  themselves. 

Thermal  noise 

Thermal  noise  basically  arises  due  to  the  random  thermally  generated  motion  of 
electrons.  It  occurs  in  resistive  devices  and  is  proportional  to  the  temperature.  Funda- 
mentally, the  thermal  energy  of  electrons  causes  them  to  move  randomly,  thus  causing 
local  concentrations  of  electrons.  This  net  concentration  of  negative  charge  in  a  local 
spot  (balanced  by  net  concentration  of  positive  charge  in  another  spot,  as  the  total 
charge  must  remain  zero)  will  result  in  a  local  nonzero  voltage.  As  the  concentration 
changes  randomly,  the  resulting  voltage  also  changes  randomly,  resulting  in  a  noiselike 
behavior.  Note  that  the  noise  exists  even  though  the  resistor  is  not  connected  and  no 
current  is  flowing  through  it  (as  opposed  to  shot  noise,  to  be  discussed  in  the  next 
subsection).  It  is  white  and  has  a  flat  power  spectral  density  (PSD)  whose  value  can  be 
given  as  follows  (depending  on  whether  it  is  modeled  by  its  Thevenin  or  Norton 
equivalent): 


(2.60) 


Here  v2  and  i2  are  the  mean  square  noise  voltage  and  current,  respectively;  k  is  the 
Boltzmann  constant;  T  is  the  temperature  in  Kelvin;  and  R  is  the  resistance  value. 
The  unit  is  V2/Hz  or  A2/Hz 


Shot  noise 

Shot  noise  occurs  in  all  energy  barrier  junctions,  namely  in  diodes  and  bipolar 
transistors.  Actually,  it  happens  whenever  a  flux  of  carriers  (possessing  potential 
energy)  passes  over  an  energy  boundary.  Since  the  potential  energy  of  the  carrier  is 
random,  the  number  of  carriers  that  possess  enough  energy  to  cross  the  barrier  is 
random  in  nature,  resulting  in  a  flux  (current  if  the  carrier  are  electrons)  whose 
density  is  also  random  in  nature.  This  random  nature  gives  rise  to  shot  noise.  It  is 
thus  obvious  that  shot  noise  exists  only  when  there  is  a  current,  as  opposed  to 
thermal  noise.  For  example,  a  bipolar  junction  transistor  (BJT)  is  a  device  whose 
current/  is  composed  of  holes  and  electrons  that  have  sufficient  energy  to  overcome 
the  potential  barrier  at  the  junction.  Thus,  the  current  consists  of  discrete  charges  and 
not  of  continuum  of  charges.  The  fluctuation  in  the  current  /  is  termed  shot  noise. 
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This  current  /  is  composed  of  random  pulses  with  average  value  7DC.  Now  it  can  be 
shown  that  shot  noise  is  white  with  a  flat  PSD  whose  value  is  given  as  follows: 


^7  =  Woe  (2.61) 

Here  q  is  the  charge  of  an  electron  in  coulomb  and  7DC  is  the  value  of  the  dc 
current  in  amperes. 


Flicker  noise 

Flicker  noise  or  1// noise  arises  from  random  trapping  of  charge  at  the  oxide-silicon 
interface  of  MOS  transistors  and  in  some  resistive  devices.  Obviously,  the  more 
current  is  flowing  in  a  nonideal  silicon  interface,  the  higher  is  the  rate  of  electrons 
that  are  trapped.  On  the  other  hand,  at  high  frequency  the  electrons  are  varying  too 
fast  to  be  trapped  and  no  noiselike  current  variation  is  caused.  Consequently,  the 
noise  density  is  given  by 

ft  ict 

k'Kf  (262) 

where  K  and  a  ( ranging  from  0.5  to  2)  are  constants  that  depend  on  the  nature  of  the 
device.  IDC  is  me  dc  current  in  amperes  and /is  frequency  in  hertz.  As  we  can  see 
from  (2.62),  flicker  noise  is  most  significant  at  low  frequencies.  However,  it  can  still 
be  troublesome  for  frequencies  up  to  a  few  megahertz. 


Additive  Noise  Versus  Phase  Noise 

Now  we  discuss  the  manifestation  of  circuit  noise  in  the  front  end.  The  noise  of  a 
subcomponent  manifests  itself  as  either  additive  noise  or  phase  noise.  Additive 
noise  is  described  by  noise  adding  to  the  amplitude  of  the  desired  signal. 
Phase  noise  is  noise  adding  to  the  phase  of  the  desired  signal.  In  a  front  end  the 
noise  inherent  in  a  LNA,  and  a  mixer  is  best  described  by  additive  noise,  whereas 
the  noise  inherent  in  a  frequency  synthesizer  is  best  described  by  phase  noise.  Both 
will  degrade  the  SNRdemodin  and  hence  sensitivity  of  the  overall  receiver  chain. 
In  this  chapter  we  will  focus  on  the  additive  noise  of  the  LNA  and  mixer  blocks. 
In  Chapters  7  and  8  we  return  to  the  discussion  of  phase  noise. 

2.4.2.2     Noise  Figure 

A  parameter  called  noise  figure  (NF)  is  a  commonly  used  method  of  specifying  the 
additive  noise  inherent  in  a  circuit  or  system.  Use  of  this  parameter  is  limited  to 
situations  where  the  source  impedance  is  resistive.  However,  this  is  often  the  case  in 
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Fig.  2.11    Model  used  in  noise  figure  calculations 


a  front  end,  and  so  this  method  of  specifying  noise  is  adopted  here.  The  noise  figure 
describes  how  much  the  internal  noise  of  an  electronic  element  degrades  its  SNR. 
It  is  often  specified  for  a  1  Hz  bandwidth  at  a  given  frequency.  In  this  case,  the  noise 
figure  is  also  called  the  spot  noise  figure  to  emphasize  the  very  small  bandwidth  as 
opposed  to  the  average  noise  figure,  where  the  band  of  interest  is  taken  into  account. 
The  interpretation  should  be  clear  from  the  context.  Mathematically  the  noise 
figure  is  defined  as 


NF  =  - 


SNR;, 
SNR01 


^llT^OU 

^nur^Vir 


(2.63) 


where  Nin  is  the  input  noise  power  and  is  always  taken  as  the  noise  in  the  source 
resistance;  and  iVout  is  the  output  noise  power  including  the  circuit  contribution  and 
noise  transmitted  from  the  source  resistance.  By  inserting  Sout  =  GS[n  into  (2.63), 
where  G  is  the  power  gain  of  the  corresponding  stage,  we  get 


NF  = 


Nn 


GN„ 


(2.64) 


Let  us  refer  to  Figure  2.11,  the  model  for  NF  calculation.  In  Figure  2.11, 


Nu 


-N. 


source_  resistance 


(2.65) 


7Vout,  the  noise  occurring  at  the  output,  is  given  by  the  input  noise  multiplied  by 
its  power  gain  G  plus  the  additional  device  noise: 


AU  =  N 


Device 


GN; 


Source -resis  tan  c  e 


Substituting  (2.65),  and  (2.66)  into  (2.64),  we  have 

^Device  +  G  ■  Ns0 


NF  = 


Source  resistance 


G-N& 


(2.66) 


(2.67) 


ource_resistance 


As  a  final  remark,  note  that  the  noise  figure  is  specified  by  a  power  ratio  and 
given  in  decibels.  We  usually  refer  to  the  corresponding  numerical  ratio  as  the 
noise  factor.  Thus,  the  relation  between  the  two  is  given  by  noise  figure  =10 
log10(noise  factor).  Which  one  we  are  referring  to  should  be  clear  from  the  context. 
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Fig.  2.12    Cascade  of  k  gain  stages,  each  with  noise 

2.4.2.3     Cascaded  Noisy  Stages 

The  noise  figure  of  a  cascade  of  noisy  stages  can  be  derived  in  terms  of  the  noise 
figures  of  the  individual  blocks.  Consider  Figure  2.12  where  a  cascade  of  noisy 
stages  each  with  available  power  gain  G,  and  noise  figure  NFt  is  shown.  Ni,  A^,-, 
NouUi  specify  the  device  noise,  input  noise,  output  noise,  respectively,  of  the  i  th 
stage.  Since  we  have  a  number  of  stages,  we  further  make  the  assumptions  that  we 
have  the  same  source  impedance  for  each  stage.  This  assures  that  we  have  the  same 
input  noise  7Vin>,  occurring  at  the  input  of  each  stage.  Referring  to  Figure  2.12  the 
derivation  consists  of  the  following  four  steps: 

1 .  We  find  the  output  power,  which  is  given  by  the  input  signal  power  multiplied  by 
product  of  the  power  gain  of  each  stage. 


Sout  —  ^'m{G\Gl  ■    .  .  .    ■  Gk) 


(2.68) 


2.  We  want  to  relate  the  noise  N,-  occurring  in  each  stage  to  the  corresponding  noise 
figure  of  that  stage.  We  start  by  applying  (2.64)  to  stage  1  and  we  have 


NF, 


SmjNoutj         (G\  ■  .  .  .  ■  G/_i)SiN^V0ut,/  Nout,i 


SouV'Mn,;  (Gl  •  .  .  .  •  Gi)SwNin,i  GjNinj 

Then  we  apply  (2.66)  to  the  i  th  stage  and  we  have 

Nmtj  =  Ni  +  GiNinA 


(2.69) 


(2.70) 


But  what  is  Nin ,?  This,  being  the  input  noise  of  the  i  th  stage,  comes  from  the  input 
resistance  of  the  i  th  stage.  Since  we  assume  that  every  stage  has  the  same  source 
resistance,  then  every  stage  has  the  same  input  noise.  For  ease  of  reference,  we 
label  all  of  them  by  the  input  noise  of  stage  1,  which  from  Figure  2.12  is  simply 
given  by  NIN.  Therefore,  Nirl  ;=NIN.  Substitute  this  in  (2.70),  we  have 


Nn 


Ni  +  dN 


IN 


(2.71) 


Now  we  can  substitute  (2.71)  in  (2.69)  and  we  have 

Ni  +  G,Mn         Nt 


NFi  = ' 


dN 


IN 


G(N 


(2.72) 


IN 
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Finally,  since  we  are  interested  in  relating  N,  to  NFj,  we  can  rearrange  (2.72)  as 

Ni  =  (NFt  -  1)G,Mn  (2.73) 

We  want  to  find  the  output  noise  power  of  this  cascade  of  stages.  We  will  use  (2.66) 
and  iteratively  apply  this  formula,  starting  from  the  first  stage  and  then  to  a 
combination  of  first  and  second  stage,  until  we  include  all  k  stages.  The  resulting 
output  noise  is  the  output  noise  from  the  k  th  stage,  NouUk-  From  Figure  2.12  we 
know  that  Noutk  is  also  denoted  as  Nout.  The  final  expression  therefore  is  given  as 


N0UT  =  {GXG2  ■  ■  ■  GK)NIN+Nl(G2  ■  ■  ■  GK)  +  N2(G3  ■  ■  ■  GK) 

+  ---+NK-iGK+NK 

We  then  substitute  (2.73)  from  step  2  in  (2.74)  and  we  get 


(2.74) 


Nou,  =  (GjGa  •  •  •  Gk)N,„  +  (W,  -  \){G,G2  ■  ■  ■  GK)Nin 

+  ■■■  +{NFK  -  \)GKNin  (2.75) 

4.  Finally  we  want  to  derive  the  total  noise  figure  of  the  cascaded  chain.  To  do  this 
we  substitute  (2.68)  from  step  1,  and  (2.75)  from  step  3  into  (2.63).  Canceling 
Sin,  Mn  and  doing  the  proper  simplification,  we  get 

NF2  -  1     NF3  -  1  NFk  -  1 

NF  =  NFi  -\ 1 ^  ...  ^ (2.76) 

G\  G\G2  G\G2-Gk-i 

So  we  have  finally  derived  the  equation  that  relates  the  total  noise  figure  to  the 
individual  noise  figure.  This  equation  is  called  the  Friis  formula.  Note  that  the  NF 
used  in  the  Friis  formula  is  specified  as  a  ratio  and  not  in  decibels.  Equation  (2.76) 
predicts  that  NF  is  dominated  by  the  first  stage  NF,  NF].  When  compared  with 
(2.48)  we  see  that  both  noise  and  distortion  are  dominated  by  one  stage.  The  only 
difference  is  that  for  noise  this  is  the  first  stage  whereas  for  distortion  this  is  the 
last  stage.  Of  course,  a  front  end  with  a  small  NF  and  a  large  IIP3  is  desirable. 


2.5     Derivation  of  NF,  IIP3  of  Receiver  Front  End 

We  have  now  finished  discussing  the  NF  and  IIP3  of  the  receiver  front  end.  We  have 
also  developed  formulas  that  relate  the  IIP3  and  NF  of  this  receiver  front  end  to 
those  of  the  individual  subcomponents.  Our  next  step  is  to  translate  the  boundary 
conditions  on  this  front  end,  imposed  by  wireless  standards  and  derived  in  Chapter  1, 
to  its  key  design  parameters.  We  assume  that  the  key  design  parameters  are 
IIP3,  NF,  G.  Boundary  conditions  such  as  SNRdemodin,  and  Pmin  (communication 
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Demodulator 


Receiver  front  end:  NFrec_frontJIP3,rec_front|Grec_front 

Fig.  2.13    Receiver  block  diagram  showing  the  G,  NF,  IIP3  and  SNR  of  the  front  end 

terminology)  that  we  derived  in  Chapter  1  are  now  translated  into  the  required  IIP3 
and  NF  (circuit  terminology)  of  the  front  end. 

To  illustrate  this  process  we  use  DECT  as  an  example  standard  and  we  use  the 
heterodyne  architecture  as  the  example  architecture.  We  will  have  a  chance  to  redo 
this  using  zero  IF  (homodyne)  architecture  in  Problem  2.9.  The  heterodyne  archi- 
tecture is  redrawn  in  Figure  2.13,  where  as  in  Figure  2.2  we  separate  it  into  two 
parts:  receiver  front  end  followed  by  the  demodulator.  The  receiver  front  end 
consists  of  the  part  between  the  antenna  and  the  demodulator  and  contains  the 
LNA,  mixer,  IF  amplifier,  and  so  forth.  It  is  characterized  by  having  its  own  gain 
Grec_front,  noise  figure  NFrec_front,  and  third  order  intercept  point  IIP3rec_front- 
This  front  end  takes  an  input  signal  from  the  antenna  with  a  signal  to  noise  ratio 
denoted  as  SNRrec_front_in,  processes  it,  and  generates  a  signal  at  its  output  with  a 
signal  to  noise  ratio  denoted  as  SNRrecfl.ont_out.  As  stated  previously,  our  goal  in 
this  section  is  to  find  the  required  Grec_front,  NFrec_front,  and  IIP3rec_front  of  this  front 
end.  To  simplify  notation,  unless  otherwise  specified  all  parameters  used  in  this 
section  are  to  be  interpreted  in  terms  of  decibels. 


2.5.1     Required  NFrecjront 

In  this  subsection,  we  want  to  find  the  required  NFrec_front-  Let  us  apply  (2.63), 
interpreted  in  decibels,  to  Figure  2.13.  We  have 


Step  1:  Calculate  SNRrec_frontout. 

From  Section  2.2,  boundary  condition  (2)  we  know  that 

SNRlec_fmnt_ont  —  25dB. 


(2.77) 


(2.78) 
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Step  2:  Calculate  SNRrec_front_in 

SNRrecfront_in  should  be  calculated  under  the  worst-case  situation.  Obviously,  the 
worst  case  happens  when  the  received  signal  from  the  antenna  is  at  its  minimum  and 
the  noise  is  at  its  maximum.  We  have  minimum  signal  power  when  the  receiver  is 
farthest  away  from  the  base  station.  From  Section  2.2,  boundary  condition  (1)  we 
know  that  Pmin  =  —77  dBm.  Hence 


_  front  _ 


-lldBm  (2.79) 


At  this  point  we  want  to  calculate  the  AWGN  in  the  channel.  This  noise  is  hard  to 
calculate.  However,  we  know  for  sure  that  since  the  antenna  has  a  50  Q  resistive 
load,  there  will  be  thermal  noise  coming  from  this  load.  We  further  assume  that  this 
is  the  only  AWGN  in  the  channel,  and  we  use  (2.60)  to  calculate  this  noise. 
Applying  (2.60),  interpreted  in  decibels,  and  integrating  throughout  the  noise 
bandwidth  B,  and  we  have 


vi"ec_rront_in 


10  log  v2  =  l0log4kTRsB  (2.80) 


Here  Rs  =50  Q.  How  do  we  calculate  Bl  Referring  to  Figure  2.13,  we  note  that  the 
receiver  front  end  has  3  different  filters.  We  take  the  one  with  the  narrowest 
bandwidth  to  define  B.  From  subsection  2.3.3  we  note  that  BPF3  has  the  narrowest 
bandwidth,  whose  bandwidth  is  given  by  (2.25)  as  2k  x  1.728  Mrad/s  or  1.728  MHz. 
Substituting  this  value  into  (2.80)  and  working  in  decibels  we  have 

Wrecj-mnUn  =  Mog10{4kTRs)  -  l0log1Q(B  /  Wz)  =  -\14dBm  +  62dB 

=  -112  dBm  (2.81) 

Now  from  the  SNR  definition  in  decibels, 

o/V/vrec_front_in  —  ^rec_front_in        ^recJront_in  yZ.oZ) 

Substituting  (2.79),  and  (2.81)  in  (2.82),  we  have 

SNRmc_bontAn  =  -11  dBm  -  (-1 12  dBm)  =  35dB  (2.83) 

Step  3:  Calculate  NFrecfront 

Let  us  substitute  (2.78),  and  (2.83)  into  (2.77).  We  get 

NFlec_fmnt  =  35dB  -  25dB  =  lOdB  (2.84) 

This  is  the  required  NF  of  the  front  end  to  satisfy  the  DECT  standard. 
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Fig.  2.14    Four  users  are 
using  their  cellular  phones  at 
the  same  time,  where  d  is  the 
maximum  range  from  user  to 
the  base  station  (dictated  by 
standards;  in  our  example  this 
equals  400  m);  d0  is  the  free 
space  distance;  d2  is  the 
minimum  allowable  distance 
between  user  1  and  user  2, 
where  user  2  uses  the  next 
channel;  and  d3  is  the 
minimum  allowable  distance 
between  user  1  and  user  3 


user  1 


2.5.2     Required  IIP3,rec  front 


In  this  subsection  we  want  to  derive  the  required  IIP3  rec_front_end.  The  larger  this 
IIP3,  the  smaller  is  the  third-order  intermodulation  product  generated  by  the 
interferers.  We  denote  this  intermodulation  product,  calculated  at  the  output  of 
the  receiver  front  end,  as  ID3  .  These  interferers,  as  mentioned  in  subsection  2.4.1, 
have  the  same  characteristics  as  that  of  narrowband  noise.  The  maximum  ID3  must 
be  made  small  enough  (achieved  with  a  large  enough  IIP3 ,rec_front_end)  that  its  power 
is  below  the  minimum  Srec_front_out  (minimum  signal  power  at  the  output  of  the 
receiver  front  end),  by  a  sufficiently  large  margin. 

Step  1 :  State  the  minimum  desired  signal  power  and  the  maximum  interferer  power 
at  the  input  of  the  front  end. 

To  calculate  the  required  IIP3,  we  must  look  at  the  condition  when  we  have  the 
minimum  Srec_fr0nt_out  and  the  maximum  Id3".  This  corresponds  to  the  condition 
where  at  the  input  of  the  front  end,  the  desired  signal  power  is  at  a  minimum,  while 
the  interferers  are  at  their  maximum. 

From  Section  2.1,  boundary  condition  (1),  the  minimum  desired  signal  power  at 
the  input  of  the  front  end  is  given  as  -77  dBm.  To  find  the  power  of  the  interferer, 
we  assume  that  the  interferences  are  from  adjacent  channel  interference.  To  under- 
stand adjacent  channel  interference,  let  us  refer  to  Figure  2.14.  Figure  2.14  shows 
three  users,  where  user  1  uses  the  desired  channel.  Let  us  assume  he  is  assigned  the 
channel  at  1.89  GHz.  Next  we  assign  the  next  two  channels  to  users  2  and  3.  Now  let 
us  consider  user  3.  User  3  is  transmitting  signal  and  interfering  with  user  1.  From 
Figure  2.14  user  3  is  closer  to  user  1  than  user  2  and  hence  the  interferering  signal 
from  him  is  larger.  Hence  he  is  placed  farther  away  in  frequency.  Specifically,  user 
3  is  assigned  a  channel  at  frequency  =  1.89  GHz  +  2  x  1.728  MHz  =  1.8934  GHz. 
User  2  is  then  assigned  a  channel  at  frequency  1.8917  GHz.  The  maximum  power 


2.5     Derivation  of  NF,  IIP3  of  Receiver  Front  End 


93 


Signal 

Strength 

[dBm] 


1 

•  •  • 

NS 

-77 
(min) 

-62 
(max) 

-43 
(max) 

•  •  • 

1 

1 

1 

<Sv 

1.89      1.8917    1.8934 

User  1     User  2      User  3 

Desired 

channel 


Frequency  [Ghz] 


Fig.  2.15    The  signal  strength  requirements  of  adjacent  channels  in  the  DECT  standard 
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Fig.  2.16    Block  diagram  showing  signal  and  interference  level  at  input  and  output  of  receiver 
front  end 


from  user  2  and  user  3  is  given  in  the  DECT  blocking  specifications.  This  is  shown 
in  Figure  2.15.  For  example,  for  a  channel  away  it  is  around  -62  dBm  [3].  Since  we 
are  interested  in  IIP3,  we  need  to  show  only  the  two  adjacent  channels. 
Hence  Figure  2.15  summarizes  the  minimum  desired  signal  power  and  the  maxi- 
mum interferer  power  at  the  input  of  the  front  end  that  is  required  to  calculate 


IIP 


3, rec  front- 


Step  2:  Calculate  the  minimum  Srec_front_out 

We  now  redraw  Figure  2.13,  but  we  specify  the  relevant  power  levels  at  the  input 

and  output  of  the  front  end.  The  resulting  diagram  is  shown  in  Figure  2.16. 
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+ 


-|  00  Frequency[MHz] 

Fig.  2.17    Frequency  spectrum  at  the  output  of  the  receiver  front  end 

In  Figure  2. 1 6,  at  the  input  of  the  receiver  front  end  the  received  signal  is  assumed 
to  consist  of  the  desired  signal  and  two  interferers.  Their  power  are  specified  as 

Minimum  power  of  desire  dsignal  at  1.89  GHz  =  —77  dBm 

Maximum  power  of  interference  from  user  2  at  1.8917  GHz  =  —62  dBm 

Maximum  power  of  interference  from  user  3  at  1.8917  GHz  =  —43  dBm. 

Next  we  determine  minimum  Srec_frontout.  Ignoring  the  nonlinearity  of  the 
receiver  front  end  for  a  moment,  we  will  see  that  for  the  desired  signal  at  1.89 
GHz,  interferers  at  1.8917  GHz,  and  1.8934  GHz  all  got  mixed  down  to  the  IF.  In 
(2.23)  we  chose  the  IF  to  be  100  MHz.  Hence  the  interferers  will  be  mixed  down  to 
100  MHz,  101.7  MHz,  and  103.4  MHz,  respectively.  Furthermore,  from  (2.24),  and 
(2.25)  we  know  that  BPF3  has  a  center  frequency  of  100  MHz  and  a  bandwidth  of 
1.7  MHz.  Hence  the  interferers  at  101.7  MHz  and  103.4  MHz  will  be  filtered  out. 
The  only  signal  at  the  front  end  output  will  be  the  one  at  100  MHz.  Since  the 
receiver  front  end  has  a  gain  of  Grecfront,  the  signal  at  100  MHz  will  have  a  power 
level  given  by  —77  dBm  +  Grec_front-  Therefore  we  have 

Minimum  5,rec_front_out  =  -lldBm  +  Grec_fmnt 


This  is  shown  in  Figure  2.16.  The  frequency  spectrum  is  shown  in  Figure  2.17. 

Step  3 :  Derive  the  maximum  ID3  ,  in  terms  of  the  maximum  power  of  interferers  at 
the  receiver  front  end  input,  Pi,  and  IIP3,rec_front- 
This  step  is  subdivided  into  three  substeps. 

Step  3a:  Derive  equivalent  block  diagram  for  receiver  front  end  and  find  the 
frequency  spectrum  for  ID3  . 

In  step  2  we  have  ignored  the  nonlinearity  of  the  front  end.  Let  us  bring  back  this 
nonlinearity  and  see  what  happens.  First  let  us  redraw  Figure  2.16  in  Figure  2.18(a). 
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Fig.  2.18   Two  equivalent  representations  of  the  front  end:  (a)  Interferers  represented  as  narrow- 
band noise,  (b)  Interferers  represented  as  tones 

Here  we  subdivide  the  receiver  front  end  into  two  blocks:  LNA_mixer  block  and 
BPF3_IF  block.  Each  block  has  its  own  gain  and  IIP3.  Their  gain  relationship  with 


Grec_front  is  given  as 


GLNA_mixer  +  Gbpf3_IF 


(2.85) 


We  now  concentrate  on  the  LNA_mixer  block,  which  consists  of  BPF1,  LNA, 
BPF2,  and  the  mixer.  The  nonlinearity  in  the  LNA_mixer  block,  as  charact- 
erized by  IIP3,LNA_mixer>  will  take  the  two  interferers  from  user  2  and  user  3 
(whose  frequencies  are  at  1. 8917GHz  and  1.8934  GHz,  respectively,  and  at  maxi- 
mum power)  and  generate  a  third-order  intermodulation  product,  denoted  as  ID3 
(whose  frequency  is  at  1.89  GHz  and  at  maximum  power).  ID3  will  then  be  mixed 
down  by  the  mixer  to  an  IF  of  100  MHz.  The  mixed  down  ID3  is  denoted  as  ID3\ 
Now  ID3',  being  at  100MHz,  will  not  be  filtered  by  BPF3.  It  will  pass  through  the 
filter  and  got  amplified  by  the  IF  amplifier  and  generate  the  ID3  (at  maximum 
power).  To  save  notations,  we  will  also  use  ID3,  Ids'  and  Id3  to  denote  the  power 
level  of  the  respective  intermodulation  products.  Which  meaning  they  refer  to 
should  be  clear  from  the  context.  Let  us  further  assume  that  BPF3  and  IF  amplifier 
are  very  linear.  Accordingly,  IIP3  BPF3IF  is  practically  given  as 


IIP 


3,BPF3_IF 


OG 


(2.86a) 
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which  means  that  IIP3,rec_front lS  given  by 

^3,rec_front  —  ^^.LNAjnixer  (2.86b) 

Hence  when  we  apply  ID3'  to  the  BPF3JF  block  no  distortion  occurs  and  no  new 
frequency  components  are  generated.  Consequently,  ID3  will  also  have  only  one 
frequency  component  at  100  MHz.  This  is  shown  in  the  frequency  plot  in  Figure 
2.17.  Likewise  this  maximum  ID3"  is  also  shown  in  Figure  2.16.  [We  will  show  in 
Problem  2.4(c)  that  even  if  the  approximation  given  in  (2.86a)  is  not  observed,  the 
maximum  ID3"  derived  in  step  3  remains  practically  the  same.] 

Step  3b:  Find  P;. 

We  know  from  step  3a  that  maximum  interferences  from  user  2  and  user  3  at  the 
front  end  input  got  intermodulated  and  mixed  to  generate  the  maximum  ID3 ' '  at  the 
front  end  output  (at  100  MHz).  In  subsection  2.4. 1 .2,  we  represented  each  interferer, 
which  is  like  narrowband  noise,  by  a  tone  (with  the  same  power)  in  order  to  simplify 
the  IM3  calculation  and  subsequently  the  IIP3  calculation.  We  will  use  the  same 
representation  here.  Hence  the  interferer  from  user  2  at  the  front  end  input  (at  1.8917 
GHz  and  with  a  power  of  -62  dBm)  in  Figure  2. 1 8  (a)  is  now  represented  by  a  tone  at 
the  same  frequency  (1 .8917  GHz)  and  with  the  same  power  (-62  dBm)  in  Figure  2. 18 
(b).  Similarly,  the  interferer  from  user  3  at  the  front  end  input  (at  1.8934  GHz  and 
with  a  power  of  -43  dBm)  in  Figure  2.18  (a)  is  now  represented  by  a  tone  at  the  same 
frequency  (1.8934  GHz)  and  with  the  same  power  (-43  dBm)  in  Figure  2.18  (b). 
To  further  simplify  the  derivation,  instead  of  having  two  different  power  levels  for 
two  tones,  we  assign  one  power  level,  taken  to  be  the  average  of  the  two,  to  both 
tones.  We  arbitrarily  decide  to  take  the  geometric  average  of  the  two  power  levels, 
which  equals  -52.5  dBm  and  assign  it  to  both  tones.  Hence  we  have 

Pi  =  -52.5dBm.  (2.87) 

Step  3c:  Derive  maximum  ID3"  in  terms  of  IIP3  rec_from 

First  we  apply  (2.38)  to  the  LNA_mixer  block  of  Figure  2.18(b)  and  we  have 

IM3 

HPi,LNA_mixer  =  Pi ^~  (2.88) 

Remember  from  (2.87)  Pt  =  —52.5  dBm.  Rearranging  (2.88)  and  substituting 
this  value  for  Ph  we  have 

IM3iLNA_mixer  =  2(P/  -  IIP3,LNA_mixer)   =  2(~52.5dBm  -  IH^LNAjnixer)       (2.89) 

However,  from  the  definition  as  given  in  (2.33)  (interpreted  in  decibels) 

/^3,LNA_mixer  =  W  -  hi  (2.90a) 
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To  reiterate,  Iq3'  and  ID1'  are  the  third-order  intermodulation  product  and 
fundamental  component  generated  by  the  two  interferers  which  got  mixed  down 
and  appear  at  the  output  of  the  mixer.  In  the  present  situation  the  two  interferers  are 
at  the  maximum  power  and  so  the  intermodulation  product  and  the  fundmanetal 
component  are  also  at  their  maximum  power  level.  Rewriting  (2.90a)  under  this 
situation,  we  have 

^A^3,LNA_mixer  =  maximum  I03'  —  maximum  Ioi'  (2.90b) 

Rearranging  (2.90b),  we  have 

maximum  IDj,'  =  maximum  /01'  + /M3  LNA_mjxer  (2.91) 

Since  maximum  ID1'  is  the  fundamental  component  generated  by  the  maximum 
interferer  with  a  power  level  of  -52.5  dBm,  it  is  given  by 

maximum  Io\   =  —52.5  dBm  +  GLNA_mixer  (2.92) 

Now  we  can  substitute  (2.89),  and  (2.92)  into  (2.91),  and  we  get 


maximum  I03  =  —52.5  dBm  +  GLNA_mixei 

+2(-52.5dBm  -  //^3,LNA_mixer) 


(2.93) 


In  discussing  (2.86a)  we  stated  that  BPF3  and  IF  AMPLIFIER  are  practically 
linear.  When  we  apply  maximum  ID3'  to  the  input  of  the  BPF3_IF  block,  the 
maximum  ID3"  generated  will  contain  only  one  frequency  component  at  100 
MHz,  whose  level  is  given  by 

maximum  I03"  =  maximum  I03'  +  Gbpf3_if  (2.94) 

Substituting  (2.93)  into  (2.94),  we  get 

maximum  ID3"  -  52.5  dBm  +  GLNA_mixer  +  2  (-52.5  dBm  -  7/f3,LNA_mixer)  +  GBpf3_if 

(2.95) 

Applying  (2.85)  and  (2.86b)  to  (2.95),  we  have 

maximum  ID3"  =  —52.5  dBm  +  Grec_front  +  2 (—52.5  dBm  —  ///J3!rec_front) 

=  -3  x  52.5  dBm  +  GKC_fmnt  -  2  x  ///\rec Jront  (2.96) 

This  gives  the  maximum  ID3"  in  terms  of  IIP3>rec_front- 
Step  4:  Relate  SNRrec_frontout  to  maximum  ID3"  and  hence  relate  SNRrec_front_out  to 
IIP3)rec_front-  From  me  required  SNRrec_front_out,  calculate  the  required  IIP3  recfront. 
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Fig.  2.19    Signal  strength  at  the  output  of  the  receiver  front  end 


In  this  step  we  refer  again  to  Figure  2.18(b).  As  shown  at  the  output  of  the 
receiver  front  end,  we  have  103",  a  tone  at  100  MHz.  At  this  point  we  want  to 
express  ID3"  using  a  narrowband  noise  representation  again.  We  denote  this  noise  as 
nIm»(t).  We  first  assume  that  nIm»(t),  like  the  AWGN  channel  noise,  is  also  additive 
and  has  a  Gaussian  distribution.  The  power  of  «/  » (f)  is,  of  course,  the  same  as  ID3" 
and  is  still  given  by  (2.96).  We  redraw  Figure  2.17  as  Figure  2.19,  where  the  value 
of  maximum  ID3"  [as  expressed  in  (2.96)]  is  explicitly  shown. 

If  we  assume  that  there  is  no  other  noise  (e.g.,  no  AWGN  from  the  channel, 
antenna,  or  circuit  noise),  then  at  the  receiver  front  end  output  we  have  a  signal 
immersed  in  noise  and  the  SNR  is  given  by 


SNRrec_front_out  =  minimum  S, 


rec_front_out 


maximum  /d3 


(2.97) 


Substituting  the  appropriate  values  for  minimum  Srec_fiont_out  and  maximum  ID3" 
from  Figure  2.19  into  (2.97)  and  simplifying,  we  have 


SNR 


rec_front_out 


=  W.5dBm  +  2  X  IIP3,rec_front 


(2.98) 


This  signal  and  noise  is  applied  to  the  demodulator  and  hence  we  have 

SNRdemodJn  =  SNRrec _front_out  =  80.5rf5w7  +  2  x  IIP3,rec _ftont  (2.99) 

Now  what  is  the  required  SNRdemod_in?  To  derive  the  required  SNRdemodin  with 
this  noise,  «/  »(f),  we  can  replace  n(t)  in  (6.14),  Chapter  1  by  nIm»(t),  so  that  (6.14) 
now  becomes 


xrU) 


a(t)si(t)exp(-jQ(t))+nim»{t) 


(2.100) 


This  equation  is  not  to  be  interpreted  in  decibels.  We  can  then  start  from  (2.100) 
and  go  through  the  rest  of  the  derivation  in  subsection  6.2.2,  Chapter  1 ,  and  derive 
the  required  SNRdemodin  for  DECT.  Since  the  noise  characteristics  are  assumed  to 
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be  the  same  as  the  AWGN  in  the  channel,  the  required  SNRdemodirl  for  DECT 
should  stay  the  same,  which  is  given  in  (1.51),  Chapter  1,  as  25  dB.  Hence  if  we 
substitute  25  dB  for  SNRrec_front_out  in  (2.99)  we  get 

25  dB  =  80.5  dBm  +  2  x  IIP3,rec_front  (2.101) 

Solving,  we  get 

HP3,rec_front  =  -27.75  dBm  (2.102) 

This  is  the  required  IIP3  of  the  front  end  to  satisfy  the  DECT  standard. 


2.6     Partitioning  of  required  NFrec  front  and  IIP3,rec_front 
into  individual  NF,  IIP3 

Our  strategy  here  is  to  start  with  a  set  of  power  gains,  NF,  IIP3,  of  the  individual 
stages,  based  on  some  existing  receiver  front  end.  This  will  provide  us  with  an 
initial  design.  We  then  calculate  the  NFrecfront  and  IIP3;fr0nt_end  of  this  existing  front 
end  and  see  if  it  satisfies  the  required  NFrecfront  and  IIP3>front_end.  Iterations  can  be 
carried  out  if  necessary.  In  subsequent  chapters,  we  go  through  the  actual  design  of 
these  stages  and  figure  out  if  the  power  gains,  NF,  IIP3  of  the  individual  stages  are 
achievable.  Further  iterations  can  then  be  carried  out  if  necessary. 

Before  we  carry  out  this  strategy,  there  is  one  more  point  to  be  noted.  Up  to  now  we 
have  assumed  that  the  power  gain  G,  is  given  as  the  square  of  the  voltage  gain. 
However,  this  is  only  true  if  termination  resistances  are  the  same.  In  general,  G  is 
given  by 

"out,/ 

where  G,,  Avi,  /?,„,„  Rout,i  are  the  power  gain,  voltage  gain,  and  input  and  output 
termination  resistance  of  the  i  th  stage,  respectively  [4].  Therefore,  instead  of 
specifying  G,  we  would  specify  the  termination  resistances  and  the  voltage  gain 
of  the  individual  stages  and  then  calculate  the  corresponding  G. 

Step  1 :  Specify  the  voltage  gain  and  termination  resistance  of  the  subcomponents 
The  heterodyne  architecture  is  redrawn  in  Figure  2.20  with  the  corresponding 
termination  resistances.  BPF1  and  BPF2  both  need  50  Q  termination  resistance; 
otherwise  the  filters  would  lose  their  frequency  responses.  For  this  example  we 
choose  a  BPF3  that  has  Rm  =  1  kQ  and  ROM  =  1  kQ.  The  LNA  is  specified  to  have 
input  and  output  resistances  of  50  Q.  The  output  resistance  of  the  mixer  is  much 
lower  than  its  input  resistance.  In  this  case  it  is  a  nice  feature  since  it  maximizes  the 
voltage  gain  of  the  mixer.  Thus  the  mixer  is  specified  with  an  input  resistance  of  50  Q 
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Fig.  2.20    a)  The  heterodyne  architecture  b)  The  heterodyne  architecture  with  the  corresponding 
input  and  output  impedance  of  the  different  subcomponents 


and  an  output  resistance  of  1  kQ.  Finally,  the  input  resistance  of  the  demodulation 
block  is  specified  to  be  1.2  kQ. 

The  voltage  gains  of  the  various  subcomponents  are  given  in  the  second  row  of 
Table  2.1.  The  power  conversion  gains  can  now  be  calculated  using  equation 
(2.103)  and  are  given  in  the  third  row  of  Table  2.1.  Overall  conversion  gain  G 
can  be  calculated  by  summing  all  the  terms  in  row  3  and  we  have 


G  =  31.6  dB 


(2.104) 


Step  2:  Specify  a  possible  set  of  NF  of  the  subcomponents  that  meets  the  required 

^"^front_end- 

First,  we  need  to  specify  the  noise  figure  of  the  individual  subcomponents. 
An  initial  set  of  values  is  given  in  row  4  of  Table  2.1.  The  general  philosophy  is 
for  the  early  stages  of  the  front  end  (basically  the  LNA)  to  dominate  the  NF  (i.e.,  the 
early  stages  should  have  good  NF  [small  values]  and  the  latter  stages  can  afford  to 
have  poorer  NF  [larger  values].  The  exact  values  (i.e.  3  dB  for  LNA  and  12  dB  for 
mixer)  depend  on  the  circuit  and  technology  details  and  will  be  covered  in 
later  chapters. 

Substituting  the  G  and  NF  values  from  rows  3  and  4  of  Table  2.1  into  the  Friis' 
formula  (2.76)  and  carrying  out  calculation  in  ratio  (not  in  decibels),  we  have  the 
noise  contributions  calculated  for  individual  components,  which  are  shown  in  row 
5  of  Table  2.1. 

Finally,  we  get  the  total  noise  figure  by  adding  the  corresponding  components  of 
the  Friis  formula. 


•    NF 


=  11.54  or  10.6  dB 


(2.105) 


2.6     Partitioning  of  required  NFrec  front  and  IIP3-rec_front  into  individual  NF,  IIP3 
Table  2.1    Characteristics  of  the  subcomponents  used  in  the  receiver  of  Figure  2.20 
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6 
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90 
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90 
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90 

20 
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[dBm] 

Components  of  overall 
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(0.15.  io~3r' 

Ignored 

(0.01.  io-3)-1 

ignored 

ignored 

Iff,  formula  (2.48) 

Comparing  this  value  and  the  required  NF^-front  [calculated  in  (2.84)  to  be  10 
dB],  we  see  that  we  have  selected  a  set  of  G,  NF  values  for  the  subcomponents  that 
allows  the  front  end  to  meet  the  required  NF. 

Step  3:  Specify  a  possible  set  of  IIP3  of  the  subcomponents  that  meets  the  required 

H*3,front_end- 

As  in  the  NF  case,  we  need  to  specify  the  IIP3  of  the  individual  subcomponents.  The 
values  are  given  in  row  6  of  Table  2.1.  The  general  philosophy  is  for  the  latter  stages 
of  the  receiver  (basically  the  mixer)  to  dominate  the  distortion.  The  exact  values 
(i.e.  —10  dBm  for  LNA  and  -10  dBm  for  mixer)  again  depend  on  the  circuit  and 
technology  details  and  will  be  covered  in  later  chapters. 

Substituting  G  from  row  3  and  IIP3  from  row  6  into  the  overall  IIP3  formula, 
(2.48),  and  carrying  out  calculation  in  ratio  (not  in  decibels),  we  have  the  IIP3 
contributions  calculated  for  individual  components,  which  are  shown  in  row  7  of 
Table  2.1.  Finally,  we  get  the  IIP3  by  adding  the  corresponding  components  in  row 
7  and  then  taking  the  reciprocal. 


TIP 


,,«_,,„.„=  ^IO/o^I/^O.IS.IO-3)    ^(O.Ol.lO-3)    1)]}+30)dBm 

20  dBm 

(2.106) 


Comparing  this  value  and  the  required  IIP3  rec_front  [calculated  in  (2.102)  to  be 
-27.75  dBm],  we  see  that  we  have  selected  a  set  of  G,  IIP3  values  for  the 
subcomponents  that  allows  the  front  end  to  meet  the  required  IIP3,front_end. 


102  2     Receiver  Architectures 

2.7     Problems 

2.1  For  DECT's  standard,  for  worst-case  reception,  calculate  the  SNR  at  the  input 
of  the  demodulator  (Figure  2.1).  We  assume  no  front  end  is  used  and  the  only 
AWGN  in  the  channel  comes  from  a  50  Q  input  resistance. 

2.2  In  the  chapter  we  discussed  that  we  want  to  go  for  a  fixed  /if  (and  hence  a 
variable  /lo)  scheme  because  this  allows  the  use  of  a  fixed  frequency  BPF2, 
which  is  easier  to  implement.  We  showed  how,  with  this  scheme,  image  can 
become  a  problem.  A  system  designer  suggests  that  we  should  instead  use  a 
variable  /if,  fix  /lo  scheme,  together  with  a  fixed  frequency  BPF2  because  he 
believes  this  will  fix  the  problem.  Is  he  correct?  To  answer  this  you  can  follow 
parts  (a),  (b),  and  (c).  Assume  that  the  receive  band  spans  from  824  to  894  MHz. 

(a)  Assume  that  we  use  a  fixed  /if  of  10  MHz  and  a  variable  f\Q.  Draw  the 
frequency  spectrum,  including  all  relevant  frequencies,  when/rf  is  894  and 
824  MHz.  Repeat  the  case  for/if  of  100  MHz. 

(b)  Now  assume  that  we  use  a  variable  fit-  and  a  fixed /lo  of  760  MHz.  Again, 
draw  the  spectrum  when/rf  is  894  and  824  MHz.  Repeat  the  case  for/i0 
of  850  MHz. 

(c)  Now  determine  the  /jmage  for  all  the  cases  in  (a)  and  (b)  and  comment  on 
whether  scheme  (b)  is  better  than  scheme  (a)  as  far  as  making  it  easier  for 
BPF2  to  filter  out  the  image. 

2.3  This  problem  concerns  the  qualitative  understanding  of  the  nonlinear  behaviour 
of  the  receiver  front  end. 

(a)  We  mentioned  that  harmonic  distortion  is  not  an  issue  for  the  heterodyne 
architecture  described  in  Figure  2.2  since  the  bandpass  filters  (BPF1,  2,  3) 
will  filter  them  out.  What  happens  if  they  do  not  filter  them  out  completely? 

(b)  We  have  shown  mathematically  what  blocking  is.  Explain,  from  first 
principle  and  in  words  (no  equations),  the  mechanism  of  blocking.  Offer 
an  explanation  in  words  (no  equations)  that  distinguishes  how  blocking  and 
intermodulation  of  interferers  affect  SNRrec_fl.ollt_out  differently. 

(c)  Is  it  possible  to  have  a  receiver  front  end  such  that  it  generates  poor  IM3  but 
does  not  block? 

2.4  This  question  clarifies  subtleties  encountered  in  Section  2.5. 

(a)  At  the  beginning  of  Section  2.5,  we  assumed  that  IIP3  is  the  only  key  design 
parameter  as  far  as  characterizing  the  receiver  front  end's  distortion 
performance.  Comment  on  the  validity  of  this  assumption. 

(b)  In  (2.81)  we  used  1.728  MHz  (=bandwidth  of  BPF3)  as  the  noise  band- 
width B.  Comment  on  the  validity  of  this  assertion. 

(c)  In  step  3  of  subsection  2.5.2  we  said  that  BPF3's  and  IF  amplifier's 
nonlinearity  does  not  matter  [refer  to  (2.86a)].  Justify  this. 
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2.5  In  Section  2.6  we  stated  that  power  gain  G  is  the  same  as  the  voltage  gain 
squared  av  only  if  RL—RS,  otherwise  it  is  given  by  (2.103).  Derive  (2.103)  and 
show  that  it  is  only  true  if  the  input  of  the  i  th  stage  is  matched  to  the  output  of 
the  previous  stage  [the  (z'-l)  th  stage]. 

2.6  Calculate  the  individual  components  of  the  Friis  formula  in  Table  2.1. 

2.7  For  the  heterodyne  architecture  whose  subcomponents  are  described  in  Table 
2.1,  plot  the  change  of  the  overall  NF  as  a  function  of  the  isolated  NF  of  the 
LNA  with  a  fixed  voltage  gain  of  12  dB  and  as  a  function  of  voltage  gain  of  the 
LNA  with  a  fixed  NF  of  3  dB. 

2.8  Let  us  reexamine  the  architecture  described  in  Figure  2.20  and  Table  2.1  as 
follows:  we  change  the  IF  amplifier's  IIP3  to  be  -20  dBm.  Now  the  mixer,  and 
the  IF  amplifier  have  IIP3  =  -10  dBm,  -20  dBm  respectively.  Hence  the 
IF  amplifier  is  poorer  than  the  mixer  in  terms  of  IIP3.  On  the  other  hand,  the 
OIP3  of  the  IF  amplifier,  even  with  this  new  IIP3  value,  is  9.2  dBm.  This 
remains  better  than  the  OIP3  of  the  mixer  (which  is  -13  dBm).  Which  compo- 
nent is  better?  Explain. 

2.9  In  this  problem  we  deal  with  a  different  standard  and  a  different  receiver  front 
end  architecture.  The  standard  is  given  as  follows:  the  carrier  frequency  is  800 
MHz  and  the  channel  bandwidth  is  200  kHz.  The  input  signal  plus  interference 
spans  from  -104  dBm  to  -10  dBm.  Assume  that  a  BPSK  modulation/demodu- 
lation scheme  is  used  and  the  required  BER  is  10"3.  Furthermore,  TDMA  and 
FDMA  are  used  to  separate  the  different  users.  To  simplify  matters,  we  will 
neglect  the  impact  of  fading  in  our  calculation. 

Figure  P.2(a)  and  Figure  P. 2(b)  show  the  block  diagram  describing  a 
different  receiver  front  end  architecture  (called  zero  IF  or  homodyne  receiver 
architecture),  together  with  a  diagram  that  specifies  the  input  and  output 
impedance  of  each  subcomponent  (matched). 

The  receiver  front  end  takes  the  signal,  filters  it  with  a  BPF  for  anti-aliasing, 
and  amplifies  it  with  a  LNA.  (The  LNA  is  assumed  to  have  a  output  matching 
network  that  performs  further  noise  rejecting  filtering.  This  output  matching 
network  [noise  rejection  filter]  is  included  in  the  LNA  block  and  not  explicitly 
shown.  The  noise  it  is  rejecting  comes  mainly  from  the  LNA  itself.)  The  signal 
is  then  fed  into  an  automatic  gain  circuit  (AGC)  which  is  then  converted  by  the 
A/D  converter.  The  sample  and  hold  circuit  in  the  A/D  converter  (to  be 
discussed  in  Chapter  6)  acts  as  a  sampling  mixer  (to  be  discussed  in  Chapter 
5).  This  sampling  mixer  mixes  the  signal  to  DC  before  getting  it  quantized. 
That  is  why  this  is  called  a  zero  IF  architecture.  It  should  be  noted  that  the 
mixing  to  baseband  (zero  IF)  is  inherently  performed  as  the  sampling  operation 
of  the  A/D  converter.  In  this  problem,  subsampling  (or  bandpass  sampling) 
is  performed.  This  means  that  the  sampling  frequency  in  the  A/D  converter  is 
much  lower  than  the  carrier  frequency  (800  MHz)  and  at  least  larger  than  the 
Nyquist  frequency  of  the  baseband  signal  (2  x  200  kHz).  As  a  side  note,  as  a 
result  of  using  subsampling,  noise  from  the  channel  and  LNA  at  around 
multiples  of  the  subsampled  frequency  will  be  aliased  and  hence  there  is  the 
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antenna 


A/D  converter 


DSP 


Fig.  P.2(a)  Receiver  architecture  assumed  in  this  example,  where  A<  denotes  the  voltage  gain,  NF, 
denotes  the  isolated  noise  figure  and  IIP3,  denotes  the  isolated  third-order  intercept  point  (power) 
at  the  input  (i  =  1,  .. .,  4) 
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50n    50s?  '       <  50n     1kn '        '  Ikn     1kn  '       I  iLo 

I       1  1       1  11  '""  1 

Zin    Zout 
Fig.  P.2(b)    Input  and  output  impedance  are  characterized  for  each  subcomponent 

need  for  BPF  and  noise  rejection  filter.  Further  demodulation  and  other  signal 
processing  is  done  in  a  digital  signal  processor  (DSP)  at  DC. 

(a)  Find  the  NF  required  of  the  receiver  front  end. 

(b)  Find  the  NF  of  the  A/D  converter.  An  A/D  converter  is  characterized  by 
another  type  of  additive  noise:  quantization  noise.  As  shown  in  Chapter  6, 

the  noise  is  white  and  the  spectral  density  of  noise  is  given  by  -j?  =  j^r 
where  A  =  step  size  of  the  least  significant  bit  (LSB)  and  is  given  by  Vps/2". 
Here  VPs  is  the  full-scale  voltage  and  n  is  the  number  of  bits  in  the  A/D 
converter.  fb  is  the  bandwidth  of  the  A/D  converter.  Suppose  VFs=3.13V, 
n=l2.  What  should  fi,  be?  What  is  the  gain  of  the  A/D  converter?  What  is 


A/' 


and  what  is  the  NF  of  the  A/D  converter? 


(c)  Assume  that  the  following  information  for  the  individual  stage  is  given: 


NF,  =2dB 


A2 


-2dB 
20  dB 


NF3  =  l0dB  A3=3ldB 

Calculate  the  noise  figure  for  the  LNA. 
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Chapter  3 

Low  Noise  Amplifier 


3.1     Introduction 

Ostensibly,  one  key  component  of  any  receiver  chain  is  the  low-noise  amplifier 
(LNA)  coming  off  the  antennas.  Since  the  signal  at  that  point  is  comparatively 
weak,  good  gain  and  noise  performance  are  necessary.  In  Chapter  2,  we  showed  that 
the  overall  noise  factor  of  the  receiver  front  end  is  dominated  by  the  first  few  stages 
and  can  be  approximated  according  to  the  Friis'  formula  as 

N^rec_ front  =       7; (^subsequent  ~  l)  +  NFLNA  (3.1) 

\1>LNA/ 

where  NFsubsequent  is  the  total  input-referred  noise  factor  of  the  components  follow- 
ing the  LNA,  and  GLNA,  NFLNA  are  the  gain  and  noise  factor  of  the  LNA  itself.  The 
noise  of  all  subsequent  stages  is  reduced  by  the  gain  of  the  LNA  and  the  noise  of  the 
LNA  is  injected  directly  into  the  received  signal.  Thus  the  LNA  needs  both  high 
gain  and  low  noise.  There  are  well-known  trade-offs  in  amplifiers  between  noise 
and  gain.  Often,  one  is  achieved  at  expense  of  the  other.  The  inherent  issues  and 
compromises  between  these  two  are  examined  in  this  chapter. 


3.1.1     General  Philosophy 

Low-noise  amplifier  design,  and  for  that  matter  any  high-frequency  amplifier  design, 
can  be  approached  from  one  of  two  methods:  lumped  parameter  or  distributive 
parameter  methods.  In  the  lumped  parameter  methods,  stability,  gain,  and  noise 
performances  are  analyzed  using  Bode/Nyquist  plots.  During  the  design,  the  active 
devices  and  the  input/output  termination  networks  are  treated  separately  as  far  as  their 
impact  on  stability,  gain,  and  noise  performance  is  concerned.  Here  voltage  and 
current  are  primary  variables  of  interest  and  two  port  representations  such  as  y,  z, 
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Fig.  3.1a    Generalized  LNA  topology 
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Fig.  3.1b    Lumped  parameter  representation  of  amplifier 


h  or  g  parameter  representations  are  adopted.  The  distributed  parameter  methodology, 
on  the  other  hand,  starts  to  take  into  consideration  the  distributive  nature  of  the  circuits 
and  uses  Smith  charts  [based  on  scattering  (S)  parameters].  Throughout  the  analysis 
both  the  active  devices  and  the  input/output  terminations  are  considered  together  in 
determining  the  impact  on  circuit  stability,  gain,  and  noise  performances.  Here  power 
becomes  the  primary  variable  of  interest  and  S-parameters  based  analysis  becomes  the 
main  tool  [1].  For  the  same  circuit  the  two  methods  will  come  to  the  same  conclusion. 
The  general  topology  of  any  LNA  can  be  broken  down  into  three  stages:  an 
input-matching  network,  the  amplifier  itself,  and  an  output-matching  network 
[Figure  3.1a].  In  Figure  3.1a  the  LNA  and  matching  network  are  characterised  by 
both  lumped  parameters  (e.g  Rin,  Rout  etc.)  and  S  parameters  (e.g.  Sn,  S12  etc.). 
Let  us  digress  a  bit  here  into  some  discussions  on  S  parameters.  There  are  four 
S-parameters  of  interest:  Sn,  S22,  Si2>  ar,d  ^21-  We  will  try  to  draw  the  analogy 
between  them  and  their  lumped  parameters  counterpart.  If  we  redraw  the  amplifier 
in  Figure  3.1a  using  a  lumped  parameter  representation  as  shown  in  Figure  3.1b,  it 
may  help  to  think,  in  a  very  crude  way,  of  the  following  analogy  and  relationship: 

521  <->  the  forward  gain,  Af 

Sl2<->  the  reverse  transmission  (or  leakage)  factor,  A,.,  which  is  usually  very  small 

in  low  frequency,  but  can  become  significant  at  high  frequency 
Su  <->  the  input  impedance,  Rin 

522  <->  the  output  impedance,  Rout 
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The  primary  motivation  for  using  the  S-parameter  is  that  at  microwave 
frequencies  the  parameters  in  a  lumped  parameter  representation  are  very  difficult 
to  measure.  This  is  because  the  short  and  open  circuit  conditions,  upon  which  the 
definitions  of  these  representations  are  based,  are  difficult  to  implement  over 
wideband,  at  high  frequencies.  In  addition,  high-frequency  transistors  are  prone 
to  oscillation  under  open  or  short  circuit  conditions.  Consequently,  this  new  repre- 
sentation, the  S-parameter,  whose  variables  are  travelling  waves  and  the  power 
associated  with  them,  is  adopted.  Having  said  that,  our  philosophy  is  to  stay  with 
the  lumped  parameter  representation  as  much  as  possible,  as  integrated  circuit 
designers  may  be  more  familiar  with  it.  Only  when  we  need  to  interface  between 
the  transistor  and  the  input/output  networks  will  we  revert  back  to  a  S-parameter 
representation.  For  example,  in  our  design  example  later  on,  we  will  go 
through  the  whole  design  process  using  lumped-parameters.  Then  we  will  calculate 
Sn,  S22  based  on  the  calculated  Rin,  Roul  and  the  characteristics  impedance  Z0 
(typically  50  Q).  In  this  way  the  S-parameter  method  will  be  made  as  transparent  as 
possible. 


3.1.2    Matching  Networks 

3.1.2.1     Objectives 

Returning  to  the  LNA,  let  us  continue  our  original  discussion  on  the  input/output 
matching  networks.  The  input  and  output  networks  are  passive,  consisting  of 
striplines,  inductors,  capacitors,  and  resistors.  These  networks  achieve,  among 
other  things,  the  following  objectives: 

1 .  They  provide  proper  terminations  to  the  analog  filters  between  the  antenna  and 
the  LNA  as  well  as  between  the  LNA  and  the  subsequent  mixer.  With  proper 
terminations,  these  filters'  frequency  responses  are  preserved.  Sometimes  the 
matching  networks  themselves  are  used  to  perform  part  of  the  filtering. 

2.  The  input  matching  n4etwork  ensures  optimum  noise  performance  as  well  as 
stability  at  the  input.  The  output  matching  network  ensures  stability  at  the 
output. 

3.  The  input  matching  network  provides  the  proper  power  matching  between  the 
antenna  and  the  LNA. 

The  objectives  of  primary  interest  to  us  are  (2)  and  (3),  which  we  will  discuss  next. 


3.1.2.2     Matching  for  Noise  and  Stability 

There  are  two  separate  issues  in  objective  (2):  noise  and  stability.  Let  us  start  by 
discussing  the  first  one,  the  noise  issue.  We  refer  to  Figure  3.1a  and  imagine  that 
initially  we  remove  the  input  matching  network.  We  further  assume  all  the  noise  in 
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the  amplifier  is  represented  by  the  equivalent  noise  voltage,  v,2,  and  current,  ;,2. 
From  [2]  it  is  shown  that  to  achieve  optimum  noise  matching,  R/s  value  should  be 
made  equal  to  the  value  jrf  a  fictitious  resistance,  called  the  optimum  noise 
resistance,  R $n.  If  v,2  and  /,2  are  uncorrected,  the  value  of  this  Rs„  is  related  to 
the  equivalent  noise  sources  of  the  amplifier  by  [2] 


tSn 


(3.2) 


If  v,2  and  ;,2  are  correlated,  then  [3]  gives  a  more  complete  expression  for  the  value 
of  Rsn.  On  the  other  hand  Rs  comes  from  the  output  resistance  of  BPF1  (or  the 
antenna  resistance  if  BPF1  is  not  used)  which  will,  in  general,  has  a  different  value 
than  the  value  of  R$„  (as  given  by  (3.2)  or  derived  in  [3]  ).  To  make  them  equal,  or 
match  them,  we  have  to  go  back  to  Figure  3.1a  and  put  in  the  input -matching 
network.  This  input  matching  network  should  be  so  designed  that  R'Sn ,  looking  into 
the  matching  network/amplifier  combination,  has  the  same  value  as  Rs. 

Next  we  briefly  look  at  the  second  issue  of  objective  (2),  the  stability  issue. 
Let  us  refer  to  Figure  3.1a  and  imagine  again  that  initially  we  remove  the  input  and 
output  matching  networks.  From  the  definition  in  [1]  we  have 

Sn  =  7    ,   „  (3-2a) 

A)  +  "in 
c  °        "out  ,n  _,  N 

S22  =  (3.2b) 

^o    \    "out 

where  Z0  is  the  characteristic  impedance  of  the  transmission  line  that  connects 
Rs  and  ^ioad  to  the  amplifier. 

How  does  stability  depend  on  these  S-parameters,  whose  absolute  value  is 
between  0  and  1?  Let  us  look  at  Sn  first  and  let  us  assume,  for  example,  that  Sn 
is  large  (close  to  1).  From  the  definition  of  S  parameters  [1],  we  have 


reflected  power 


incident  power 


input 


Hence  having  Sn  close  to  1  means  that  the  reflected  power  is  almost  equal  to  the 
incident  power.  This  means  that  whatever  power  we  deliver  to  the  amplifier  is 
almost  totally  reflected.  This  reflected  power,  again,  is  going  to  be  reflected  by  the 
voltage  source  Vs  back  to  the  amplifier.  We  can  then  qualitatively  see  that  there  will 
be  large  amount  of  power  shuffling  back  and  forth  and  that  the  amplifier  is  on  the 
verge  of  oscillation.  On  the  other  hand,  if  Sn  is  small  (close  to  0),  the  reflection  will 
be  small  and  the  amplifier  becomes  more  stable.  Therefore,  in  principle  Sn  should 
be  kept  as  small  as  possible  as  far  as  stability  is  concerned.  Ideally,  Si  1  should  be  0. 
From  (3.2a)  this  means  Rln  should  be  kept  close  to  Z0.  In  cases  when  Rin  is  very 
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different  from  Z0  we  have  to  go  back  to  Figure  3.1a  and  put  in  the  input-matching 
network,  which  will  then  transform  Riu  to  the  required  value,  denoted  as  ^,i_sraMe- 
Similar  observations  and  conclusions  go  for  the  output  matching  network  and  RoM, 
^22>  R'out  s,ahie-  When  the  input  and  output  matching  networks  are  put  back  in 
Figure  3.1a  (3.2a),  and  (3.2b)  will  be  modified  and  will  become 

7     n! 

/-iQ        i\  instable  /0  «    . 

Sn=T~nt< (3-2c) 

t-'o    i    n  in-Stable 
r,  ^o        K-  out_stable  /0  ~  ,, 

522  =  ^— = (3.2d) 

*^o  ~r  "■  out_stable 


3.1.2.3     Matching  for  Power 

We  now  elaborate  on  objective  (3).  Referring  to  Figure  3.1a,  with  proper  power 
matching,  Vs  can  deliver  the  maximum  amount  of  power  to  the  LNA.  This  concept 
is  called  power  matching  and  is  discussed  in  some  detail  in  this  sub-section. 

First  Figure  3.1a  is  redrawn  in  Figure  3.2a  without  the  matching  network  while 
using  Figure  3.1b  to  represent  the  amplifier.  Hence  Vout_amp  is  the  same  as  V„. 
In  anticipation  that  the  amplifiers  to  be  discussed  later  in  the  chapter  are  MOS 
based,  we  label  the  input  voltage  of  the  amplifier  Vjn_amp  as  Vgs.  First  let  us  look  at 
the  case  when  the  amplifier  of  Figure  3.1b  is  assumed  to  be  unilateral.  In  this  case 
we  can  set  Ar  =  0.  Let  us  refer  to  Figure  3.2a  and  define  our  goal  as  the  delivery  of 
a  maximum  amount  of  power  to  /?ioad,  the  load  resistor.  Obviously,  to  achieve  that, 
V0  and  hence  Vgs  have  to  be  at  their  maximum.  This  can  be  done  by  making  Rin 
infinite.  Now  let  us  look  at  the  case  when  the  amplifier  of  Figure  3.1b  is  not 
unilateral.  Hence  the  dependent  current  source  given  by  ArVout_amp  cannot  be 
neglected.  Under  this  case  to  deliver  maximum  amount  of  power  to  Rioad  we  need 
(the  detail  derivation  is  skipped  here): 

Rin  =  Rs  (3.3) 


An  alternative  development  that  leads  to  the  same  conclusion  is  also  available  if 
we  represent  the  amplifier  using  S  parameters.  With  S  parameters,  people  tend  to 
work  in  power  (rather  than  voltage)  as  the  variable.  Rather  than  talking  about  power 
delivered  to  7?ioad  (Pl)  as  being  related  to  voltage  through  the  expression 

V2 
Pl  =  vjl-  (3.4) 

"load 

we  relate  this  power  through  the  expression  involving  GT,  the  transducer  power  gain. 
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-WV 


v. 


Fig.  3.2a    LNA  using  lumped  parameter  representation 


This  is  defined  in  [1]  as 


Gt  — 


AVS 


power  deliverd  to  Rioad 
power  delivered  from  source 


(3.5) 


Since  Pavs  from  source  is  fixed  (e.g.,  for  DECT,  the  minimum  value  is  min  P AVS  = 
—11  dBm),  from  (3.5)  maximum  PL  corresponds  to  maximum  GT.  From  [1]  maximum 
Gt  is  obtained  when  both  input  and  output  are  matched.  To  achieve  this,  what  people 
do  is  to  match  Rm  to  Rs.  Hence  again, 


R„ 


R, 


(3.6) 


If  Rin  does  not  have  the  same  value  as  Rs,  (3.3)  or  (3.6)  can  be  satisfied  by  inserting 
a  matching  network  and  transforming  Rin  to  R[n _power,  as  depicted  in  Figure  3.1a. 
This  R[npowa-  should  be  set  equal  to  Rs.  Similar  conclusions  go  for  Rout,  ^'outpower. 
One  such  example  where  the  input  matching  network  is  done  using  a  transformer 
is  shown  in  Figure  3.2b. 

Let  us  now  refer  to  Figure  3.1a  again  and  summarize  what  we  have  done  so 
far  in  sub-section  3.1.2.  We  will  concentrate  on  the  input  matching  network. 
The  conclusions  essentially  carry  over  to  the  output  matching  network.  Basically, 
we  start  with  a  given  R^n,  build  a  matching  network,  and  transform  7?jn. 
This  transformation  is  used  to  achieve  an  optimum  noise  performance,  maintain 
stability,  or  achieve  matching  for  power.  Since  these  are  three  separate  goals  to 
achieve  and  we  only  have  one  matching  network  (hence  one  unique  transformation), 
a  compromise  has  to  be  struck.  However,  as  shown  in  [3],  if  we  choose  the  objective 
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Fig.  3.2b   LNA  with  transformer  matching  at  the  input 

of  maximum  power  transfer,  we  are  still  close  to  achieving  the  objective  of  achieving 
optimum  noise  performance. 


3.1.2.4     Implementation 

In  Figure  3.2b,  an  ideal  wideband  matching  element  was  used:  a  transformer. 
Although  it  is  possible  to  utilize  transformers  at  RF,  for  higher  turn  ratios  we  run 
into  difficulty  due  to  the  interwinding  capacitance  limiting  the  frequency  response 
of  the  coils.  Wideband  match  is  preferably  implemented  using  active  elements  in 
feedback  to  achieve  controlled  impedance.  Other  wideband  techniques  include 
using  feedback  around  the  amplifier  [3].  However,  given  that  the  signal  is 
inherently  narrowband,  another  form  of  impedance  transformation  is  available: 
transformation  using  resonant  circuits.  This  narrowband  matching  can  be  done 
using  either  passive  RLC  circuits  [1]  or  feedback  around  active  circuits.  Further- 
more, narrowband  resonant  matching  has  another  advantage:  the  required  input  and 
output  noise-limiting  filters  can  be  folded  in  as  part  of  the  matching  network. 


3.1.3     Comparisons  of  Narrowband  and  Wideband  LNA 


Because  of  these  two  types  of  matching  we  can  think  of  having  two  types  of  LNA. 
One  uses  a  narrowband  amplifier,  and  the  other  uses  a  wideband  amplifier. 
The  matching  networks  in  these  two  types  of  LNA  differ  in  their  frequency 
response,  one  having  a  wideband  response  and  the  other  having  a  narrowband 
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response.  Narrowband  amplifiers  have  a  tuned  matching  network  at  input  and 
output.  If  integrated  with  the  following  stage  on  chip,  sometimes  an  explicit  output 
matching  network  can  be  omitted.  Wideband  amplifiers,  on  the  other  hand,  need  a 
wideband  matching  network.  The  matching  network  should  compensate  for  the 
frequency  response  of  the  amplifier  so  that  we  can  get  a  flat  frequency  response  at 
the  output.  For  multistage  wideband  amplifiers,  an  interstage  matching  network  is 
needed. 

What  are  the  pros  and  cons  of  the  two  approaches?  The  wideband  solution 
consists  of  designing  a  general  wideband  amplifier  first.  We  can  then  place  a 
bandpass  filter  at  its  output  to  achieve  band  selection.  The  advantage  of  this  method 
is  that  the  design  work  is  divided  into  two  independent  steps,  making  each  task 
more  manageable  and  with  fewer  variables  and  constraints.  A  bandpass  filter  with 
accurate  center  frequency  is  easier  to  achieve  because  of  the  wideband 
characteristics  of  the  amplifier  part.  The  problem  with  this  method  is  unnecessarily 
demanding  that  the  amplifier  possess  a  wideband  response,  thus  making  the  circuit 
structure  complicated  and  power  hungry.  Moreover,  the  circuit  performance,  espe- 
cially the  noise  performance,  is  poor.  On  the  other  hand,  for  the  narrowband 
approach,  after  proper  narrowband  impedance  matching  and  low  noise  optimizing 
at  the  input,  not  only  can  we  get  low  noise  performance  but  we  can  also  knock  down 
the  DC  power  by  a  substantial  amount.  At  the  output  we  use  a  LC  tank  circuit  to 
peak  the  gain  so  that  we  can  omit  the  additional  gain  stage  and  make  the  circuit 
simple.  The  disadvantage  of  the  narrowband  method  lies  in  the  difficulty  of 
achieving  bandpass  amplification  with  accurate  center  frequency  due  to  circuit 
element  value  variation  inherent  in  a  very  large  scale  integration  (VLSI)  process. 
This  can  be  taken  care  of  by  tuning  the  tank  circuit  on  the  chip.  Also,  there  is  a  need 
for  low-loss  inductor,  which  has  just  become  available  in  the  modern  VLSI  process. 
The  practical  wideband  and  narrowband  design  approaches  will  now  be  discussed 
in  the  following  sections. 


3.2     Wideband  LNA  Design 

Based  on  Table  2.1  of  Chapter  2,  we  present  a  sample  specification  of  LNA  in 
Table  3.1. 

Compared  with  Table  2.1,  we  have  added  a  specification  on  Sn  (for  stability) 
and  a  power  specification.  Also,  to  simplify  design  we  assume  that  IIP^lna  does 
not  dominate  IIP3>rec_front  and  hence  is  not  considered.  This  LNA  is  supposed  to 
obtain  its  input  from  BPF1  and  drive  BPF2,  as  shown  in  Figure  2.1  of  Chapter  2. 

First,  a  wideband  LNA  design  is  examined.  A  wideband  design  is  simpler,  since 
the  filter  and  amplifier  design  can  be  decoupled.  When  we  design  a  wideband 
amplifier,  we  have  to  consider  the  following  issues: 

•  The  frequency  response,  which  is  required  to  be  flat  within  the  specified  toler- 
ance over  the  entire  bandwidth 

•  Low  noise  characteristics  over  the  bandwidth 
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Table  3.1    Specifications 
of  LNA  for  DECT 


/„=  1.9  GHz 

Input  matching:  Sn  <  — 10  dB 
Noise  factor:  <  3  dB 
Voltage  gain:  20  dB 
Power  at  3.3  V:  40  mW 


•  Input/output  matching  for  maximum  power  transfer 

•  Stability,  which  must  be  maintained  throughout  the  entire  bandwidth 

There  are  two  ways  to  design  a  wideband  amplifier.  One  is  based  entirely  on  the 
S-parameter  method,  which  is  often  used  by  microwave  engineer.  Alternatively,  we 
can  use  lumped  parameter  representation,  as  shown  in  Figure  3.1b,  as  a  starting 
point  for  designing  the  core  amplifier.  This  has  the  advantage  of  letting  circuit 
designers  perform  the  design  task  using  more  familiar  tools.  Lumped  parameters 
tend  to  be  simpler  to  apply  and  may  lend  better  insight  into  the  operation  of  the 
circuits  (e.g.,  feedback  theory  can  be  applied  rather  easily).  The  terminal  behaviour 
of  the  core  amplifier  is  then  expressed  in  5-parameters.  The  design  of  the 
matching  network  can  then  be  carried  out  using  S-parameter-based  techniques  as 
described  in  sub-section  3.1.2. 

We  now  go  through  a  wideband  LNA  design  based  in  CMOS  step  by  step. 
The  design  procedure  is  a  synthesis  and  optimization  process.  In  this  design  we 
focus  briefly  on  the  design  of  the  matching  network  and  concentrate  on  the 
amplifier  design.  For  more  detail  on  matching  network  design,  refer  to  [1]. 

According  to  the  specification,  we  first  decide  on  a  tentative  amplifier  circuit 
topology.  Figure  3.3a  is  one  such  topology.  We  decide  to  select  a  matching  network 
that  will  achieve  the  goal  of  optimum  power  delivery  and  maintaining  stability. 
At  input  this  matching  is  done  by  using  a  straightforward  50  Q  RmalCh  (Rs  is  taken  to 
be  50  Q).  This  is  done  purely  for  simplicity.  Note  that  noise  matching  is  not 
performed.  (As  a  side  note,  7?match  itself  introduces  extra  noise.)  The  matching  at 
output  is  achieved  by  making  the  output  resistance  equals  50Q.  A  pair  of  source 
followers  is  needed  to  do  this  matching.  The  first  buffer  stage  is  smaller  than  the 
second,  to  minimize  loading  on  the  transresistance  pair;  the  second  is  designed  such 
that  —  is  equal  to  50  Q. 

The  core  amplifier  consists  of  two  stages.  The  first  stage  is  a  simple  common 
source  (CS)  transconductance  structure  (M {),  which  is  good  for  low-noise  design. 
This  is  driving  a  tightly  coupled  transresistance  amplifier  with  an  active  feedback 
path.  This  second  stage  can  also  be  looked  at  as  a  CS  amplifier  with  active  shunt- 
shunt  feedback.  The  feedback  is  introduced  to  modify  the  second-stage  input  and 
output  impedance.  This  helps  to  broadband  the  core  amplifier  by  reducing  the 
Miller  capacitance  of  M;.  This  proposed  amplifier  topology  is  similar  to  what  is 
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Fig.  3.3a   Wideband  LNA 


done  in  conventional  GaAs  MMIC  design  [4].  As  an  example,  we  design  the 
amplifier  in  a  0.8  um  CMOS  process.  One  thing  to  note  is  that  even  though  the 
output  resistance  of  the  transresistance  stage  is  reduced  due  to  feedback,  it  is  still 
difficult  to  make  this  output  resistance  (effectively  l/gj^J  sufficiently  low  such  that 
it  can  achieve  a  50  Q  match  with  BPF2.  That  is  why  Mbufl  and  Mbuf2  were  used. 


3.2.1     DC  Bias 


First,  the  DC  bias  of  the  entire  topology  is  examined.  One  interesting  bias  problem 
involves  establishing  the  VGS  of  M ;  and  the  associated  drain  current.  Given  that  the 
RF  signal  from  the  output  of  BPF1,  Vs,  is  capacitively  coupled  into  the  amplifier  at 
this  point,  it  is  important  to  prevent  Vs  from  being  injected  into  the  current  bias 
chains.  This  is  accomplished  by  using  an  R-C-R  filter,  formed  by  Mra-Mcap-M,h.  Mra 
and  Mrh  are  biased  in  the  triode  region  and  have  a  W/L  =  2.4um/5um.  Their 
resistances  are  calculated  to  be  8£Q.  The  center  capacitor  Mcap  is  implemented 
using  a  100  um/100  urn  FET  gate.  At  AC,  this  R-C-R  network  forms  a  low-pass  filter 
and  filters  Vs  before  it  hits  the  gate  of  Mdiode.  At  DC,  it  allows  the  DC  bias 
established  at  the  gate  of  Mdiode  to  be  copied  to  the  gate  of  Mx.  This  bias  is 
essentially  set  up  by  applying  V^biasi  to  Mbias,  which  in  turns  set  up  /o(Mdiode),  and 
hence  set  up  V^Mdiode)-  ^biasi  is  also  applied  to  M4,  M5.  Due  to  different  W/L  ratios, 
Ida,  Ids  are  different  from  //3<Mdiode)  (and  hence  ID\).  In  particular,  ID4,  IDi  are  set  up 
such  that  their  difference  becomes  ID2,  the  bias  current  for  M2.  VbiaS2  ^s  usec' t0  set 
up  the  bias  current  in  Mbufl  and  Mbuf2. 
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Fig.  3.3b    The  a  circuit  of  core  amplifier  with  loading 

3.2.2     Gain  and  Frequency  Response 


Let  us  concentrate  on  the  amplifier  core.  The  coupling  capacitor  Cc  is  considered  a 
short  circuit  at  RF  and  so  Vin  is  applied  to  the  gate  of  M1.  Vrf3  is  the  output  voltage. 
First  let  us  review  the  derivation  of  the  low-frequency  small-signal  voltage  gain, 
neglecting  frequency-dependent  effects  from  device  capacitance. 


Av  =  (gain  of  1st  stage)  •  (gain  of  2nd  stage) 
From  Figure  3.3a, 


gain  of  1st  stage 


'm, 


gain  of  2nd  stage 


(3.7a) 

(3.7b) 
(3.7c) 


Now,  the  gain  of  second  stage  =  gain  of  a  transresistance  amplifier  with  shunt-shunt 
feedback.  From  feedback  theory, 


gain  of  2nd  stage 


l  +  af 


(3.8) 


where  a  =  forward  gain  with  loading,  and/=  feedback  factor. 

To  calculate  a  let  us  refer  to  Figure  3.3b,  which  shows  the  forward  amplifier  with 
loading  from  the  feedback  network  (shown  as  the  two  M2  transistors  with  the  drain 
grounded  at  the  input  and  output  nodes).  From  the  figure,   iM    flows  into  a 


resistance  of 


\R» 


i 

SM2 


(since  R\„M  ,  the  input  resistance  of  M3,  is  infinite). 


118 


3     Low  Noise  Amplifier 


Fig.  3.3c    The  /  circuit  of 
core  amplifier  with  loading 
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This  develops  a  voltage  of  — .  This  voltage  is  multipled  by  the  voltage  gain  of  M3 

8M2 

to  develop  iv/3-  For  this  single'-stage  CS  amplifier  the  voltage  gain  is  simply  gM/out> 
where  rout  =  ro3||ro5|| /-,■„,  =  ro3\\ro5  as  r,-„,  =  00.  Here  ro3,ro5  are  the  output 
resistances  of  M3,  M5  and  /-,„,  is  the  input  resistance  of  M2.  Now  we 
denoter03||r05  =  Rop  and  so 


Vrf3 


f'Mt 

\8m2 


8m3Rq 


(3.9) 


Therefore,  from  (3.9), 


Vrf3 
'Mi 


gAf, 


£m3«o 


(3.9a) 


This  is  very  large,  as  /?op  is  large.  Next  let  us  look  at  Figure  3.3c  and  see  that /is 
defined  as 


vf 


SM2 


(3.9b) 


Substitute^. 9a),  (3.9b)  in  (3.8)  we  note  that  since  af  is  large, 


second  stage  gain  «  - 


(3.9c) 
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Substituting  (3.9b)  into  (3.9c),  we  have 

second  stage  gain  =  —  (3.9d) 

gm2 

Substituting  (3.7b),  and  (3.9d)  into  (3.7a),  we  have 

Av  =  gmi—  (3.10) 

8m 

Notice  that  the  body  effect  loss  has  been  neglected. 

Next  let  us  derive  the  high-frequency  gain.  Upon  detailed  derivation,  the  high- 
frequency  gain  of  the  amplifier,  including  frequency  effects  due  to  device  capacitance, 
can  be  expressed  approximately  as  [5] 


gM,  -jmC2 

Av{jCO)=gMl  x 


{gMy  -jcoC2)(gM2  +jcaC2)  +  (gMl  +jmC3)  U^-jmC4 

(3.10a) 

where 

C2  =  Cgs2+Cgd3  (3.10b) 

C3  =  Cpdi  +  Cpd4  +  Cgs3  +  Cgdl  +  Cgs2  +  Cgd3  (3.10c) 

Ca,  =  Cpd3  +  Cgd2  +  0.5C?.5,bufi  +  Cpd5  +  Cgd5  +  Cgdi  (3.10d) 

Rop  =  r03\\r05  (3.10e) 

It  should  be  noted  that  capacitances  C2  through  Ca,  are  the  lumped  parasitic 
capacitance  at  each  node  in  the  circuit.  CPD  and  CPs  are  the  parasitic  drain/source 
capacitances.  At  low  frequency  (3.10a)  reduces  to  (if  Rop  ^>  l/g„,3,  which  is  almost 
always  true) 

Av=—  (3.11) 

gml 

which  agrees  with  (3.10). 

The  -3  dB  frequency  can  be  extracted  from  (3.10a): 

o^dB  =  t ,    8^8m2Rop (3  12) 

[gM,  —  gM2)RopC2  +  gM2RopC4  +  CpDl 
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which  is  equal  to 

—. SMl8M2R0p  (313) 

£±-  _| Lmi |_  I A L_ )  q2  Cpdi 

SM3         gMigM2Rop  \8m2         Sm^J 

Intuitively,  this  simplified  expression  agrees  with  the  results  from  zero-valued 
time-constant  analysis.  To  see  why  this  is  the  case,  let  us  refer  to  Figure  3.3a  again 
and  note  that  the  frequency  rolloff  is  controlled  primarily  by  the  parasitic 
capacitances  from  the  gate  of  M3  and  the  drain  of  M;  (both  included  in  the  CPdi 
term).  Hence  the  dominant  pole  is  calculated  by  considering  the  drain  node  of  Mx. 
This  pole  can  be  calculated  by  the  zero-valued  time-constant  method  as 

*=2*KC  (114) 

where 

C  =  CPDI  (3.15) 

and  resistance  R  equals  the  input  resistance  of  the  shunt-shunt  feedback  pair  (RincL) 
with  a  value  given  by 

R,ncL= l—r  (3.16) 

gM2gM}KQp 

Substituting  (3.15),  and  (3.16)  into  (3.14)  gives  a  p\  that  agrees  with  tO-^dB  as 
given  in  (3.13).  Since  RmcL  is  so  important,  let  us  go  through  the  exercise  and  see 
how  (3.16)  is  derived.  Let  us  refer  again  to  Figure  3.3a.  To  calculate  R[nch,  the 
closed-loop  impedance  looking  into  the  M2,  M3  shunt-shunt  feedback  loop,  we  first 
calculate  RinoL,  the  open-loop  impedance  looking  into  the  M2,  M3  pair,  including 
loading  from  the  feedback  path.  This  can  be  obtained  by  referring  again  to  Figure 
3.3b.  As  can  be  seen,  7?inoL  is  given  by 


(3.17) 


D                                      11/? 

"mm.  —             ll^inw, 
gM2               3 

Since  R^ 

=  oo, then 

1 

gM2 

(3.18) 

gu2 

From  (3.9a),  and  (3.9b),  we  have 

af  =  gMiRov.  (3.19) 
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From  feedback  theory  RmcL  is  obtained  from  RmoL  by 

«inCL  -      r    .     ,   ~  F"  (J. 20) 

Of  +  1         qf 
since  a/  >  1.  Subsituting  (3.19),  and  (3.18)  into  (3.20),  we  have 

RlncL  = 1-=-  (3.21) 

gM2gM3Kop 

Hence  (3.16)  is  confirmed.  This  reduction  of  impedance  is  performed  at  the  point 
where  the  largest  capacitance  is  hanging,  namely,  at  the  drain  node  of  Mx .  This  drain 
node  has  a  large  parasitic  capacitance  CPDi  because  (^)  of  M;  is  large  (this  is,  in 
turn,  done  so  as  to  reduce  noise,  as  will  be  shown  in  sub-section  3.2.3).  This  exercise 
highlights  why  shunt-shunt  feedback  is  performed:  mainly  to  reduce  Ri„CL ,  which 
increases  pY  according  to  (3.14),  and  hence  broaden  the  frequency  response  of  the 

whole  circuit.  Finally,  returning  to  the  full  expression  in  (3.13),  the  (— —)Cz 

correction  term  accounts  for  the  pole  splitting  introduced  by  Cz  shuntedacross  the 
gate  drain  of  M3. 


3.2.3    Noise  Figure 

In  this  sub-section  we  will  derive  the  NF  of  the  amplifier  core.  From  chapter  2  noise 
figure  is  defined  as 

Np  =  Nder+Nm=         N^_ 
Nm  Nin 

where 

A^dev  =  noise  coming  from  the  amplifier  core 
Ni„  =  noise  from  the  source  resistance  Rs 

Noise  from  the  amplifier  core  is  further  broken  down  into  noise  coming  from  the 
first  and  second  stages.  For  the  first  stage  the  noise  figure  becomes  that  of  the  noise 
figure  of  a  MOS  transistor,  which  is  assumed  to  be  dominated  by  thermal  noise  of 
the  drain  current  and  is  given  by  [2] 

4/cT-  —  9 

NF=l+ l^i=i+ (3.23) 

4kTRs  3gMlRs 

Again,  for  a  more  complete  picture  on  NF  of  an  MOS  transistor,  refer  to  [3]. 
Including  the  noise  contribution  from  the  second  stage  and  the  most  prominent 
frequency-dependent  effect,  we  have  (detail  derivation  is  covered  in  problem  3.2) 
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NF=  1 


2^,2'i   ,        2gf 


ml 


2>gmiRs      3g2mlRs      3g2mlgm3Rs 

2        ,  2g^   |        2g2 
3g„,iRs      3g2mlRs      3g2mlgm3R, 


ml  \  ,.'1/^1     d2 


co^R  (3.24) 


Note  that  noise  from  M1;  when  input  referred,  is  also  being  shaped  in  the 
frequency  domain  by  Cgsi  and  Rs.  This  is  apparent  in  the  y^-  (o2C2slRs  term  in 
(3.24).  Obviously,  we  could  also  include  the  parasitic  capacitance  of  M2  and  M3,  but 
the  equation  for  the  noise  figure  would  become  rather  messy  and  would  lose  its 
usefuleness. 

At  this  point  we  have  derived  all  the  relevant  design  equations.  Next  let  us  go 
through  a  design  example. 

Design  Example  3.1. 

We  now  illustrate  how  to  design  a  wideband  LNA  using  the  topology  as  shown  in 
Figure  3.3a,  concentrating  on  the  amplifier  core.  The  specifications  are  given  in 
Table  3.1  and  the  underlying  technology  is  selected  to  be  0.8um  CMOS.  The 
equations  in  subsections  3.2.2,  and  3.2.3  can  now  be  used  to  help  size  the  transistors 
and  current.  First  we  give  some  general  philosophy  on  how  to  do  the  sizing. 

M2  and  M3  are  necessarily  small  devices  to  minimize  the  impact  of  their  parasitic 
capacitance.  Moreover,  it  is  desirable  that  M2  be  small,  since  the  amplifier  gain 
increases  with  decreasing  gm2  and  hence  decreasing  (j;)2  [see  (3.10)].  However,  M; 
is  necessarily  a  very  large  device,  to  maximize  gain  [see  (3.10)]  as  well  as  to 
minimize  noise  [see  (3.23)].  Thus,  the  parasitic  parameters  and  area  capacitance  at 
its  drain  determine  the  overall  frequency  response  of  the  amplifier.  Next  we  do  the 
design  steps  by  steps. 

Step   1 :  Designing  Mj    by   determining   IDJ   and    (?)     from  NF  and  power 

specification 

From  Table  3.1  we  set  NF  =  3  dB.  Putting  this  in  (3.23)  and  setting  Rs  =  50Q, 

we  have 

2=1+-     ^on  (3.25) 

3gml50Ll 


1 


=  75Q  or  gMl  =  0.013Q"1  (3.26) 

Now  gMi  is  related  to  bias  current  ID1  and  (^)    as  follows: 


^  =  ^(11     O'-°-013frl=V—-  UA  0.27) 

where  k  =  \iCox 
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We  have  one  equation  and  two  unknowns:  Ioi  and  (j;),-  To  determine  Iqi  and 
(]r)i!  we  need  one  more  equation,  which  we  can  get  from  the  power  specs. 
To  proceed  we  first  observe  that  since  noise  is  dominated  by  the  first  stage, 
minimizing  noise  means  gM,  ^>  gM2,  giu3-  This  in  turn  implies  lDX  ^>  ID2,  Io3- 
Thus,  the  power  of  the  amplifier  core  is  dominated  by  current  lm.  Next  from 
Figure  3.3a  there  are  four  branches  (through  Mdiode,  Mi,  Mbufl,  Mhu{2).  Let  us 
assume  that  they  carry  the  same  current.  Hence  total  current  —  4  x  IDl.  Thus  the 
total  power  fxotab  xs  given  as  4  x  IDl  x  Vdd-  It  is  clear,  then,  with  a  Vdd  of  3.3 
V  and  a  power  specification  of  40m W  we  have: 

,       .    P,o,a,  AQmW 

Ini  < = =  3mA  (3.27a) 

4  x  Vm      4  x  3.3V  V  ; 

We  put  in  some  safety  margin  and  set 

lDl  =  1. 2mA  (3.27b) 

Substituting  this  back  in  equation  (3.27),  we  get 


0.0 1  30  4=,|2xl .2mA  x  k'[  —  )  < 3.28 ) 

Assuming  k'  =  100  uA/V   in  our  0.8  urn  CMOS  process,  this  gives 

W\        „0„      590mot  __n 

T      =731  =  7^—  (3-29) 

Step  2:  Designing  M2  by  determining  ID2  and  (jrj.from  gain  specification 
To  determine  gMj,  let  us  go  to  (3.10).  We  know  from  Table  3.1  that  Av  —  20  dB  or 
10,  and  from  step  1  gM  —  0.013  Q_1.  Substituting  these  in  (3.10),  the  resulting 
equation  is  as  follows: 

10  =  gM|  =  0-013Q"' 


Therefore,  we  have 


gM2  8m2 


8m2  =  ^  (3-30) 


To  achieve  this  gM^,  which  is  10  times  less  than  gM  ,  we  have  two  variables:  ID2 
and  (f-)2-  Some  simple  choices  include  (1)  setting  1D2  to  be  jg,  and  (j;)2  —  (jr){> 
or    (2)    setting    (x)2  =  Trjo  (r)i    and   lDi=h\\    or   (3)    setting   ID2  =  ^    and 


124  3     Low  Noise  Amplifier 

(j;)2  —  "hj  (i")i-  We  choose  the  third  alternative.  Therefore,  Iqi  =  uj^oi  =  0.12mA 
and  (j;)2  =  §j^\  ■  Among  the  simple  choices  the  advantage  of  the  third  alternative 
is  that  it  ends  up  having  both  a  small  I 02  and  (f-)2-  We  want  a  small  ID2  so  that  our 
assumption  in  step  1,  thatID2-^lID1,  still  holds.  Hence  the  power  specification  is  still 
met.  Also,  we  want  (W)2  smail  because  that  means  a  small  C^-  Hence  Ci  in  (3.13) 

remains  small.  Therefore,  the  correction  term  in  (3.13),  (— — )C2  (due  to  pole 

splitting),  is  also  small  and  (O-^dB  is  large,  which  is  a  desirable  feature.  With  the 
third  simple  choice  one  can  further  optimise  by  varying  1 02  and  (j;)2  as  long  as  the 
combination  results  in  a  gMl  that  satisfies  (3.29). 

Step  3:  Designing  M3  by  determining  ID3  and  (jrjvfromfo  specification 
From  Table  3.1,  fo  =  1.9  GHz.  To  pass  the  signal,  f_3  dB  should  be  larger  than  1.9 
GHz.  Here,  to  simplify  calculations  we  set  f_3  dB  =  1.9  GHz. 
f-3  <ib  is  obtained  from  (3.13)  and  is  given  as 

f                 gM^gMiRop  ,„n 

J-MB  =     -    n (3.31) 

We  know  gM2  from  step  2  [see  (3.30)].  Next  let  us  find  CPDi,  the  parasitic 
capacitance  at  the  drain  of  M\.  Assuming  CPDi  is  dominated  by  the  parasitic  capaci- 
tance of  the  large  transistor  M1;  we  have 

Cpdi  =  Cgd\  +  Cdbi  (3.32) 

However,  Cglji  is  the  drain  overlap  capacitance  of  Mi  and  is  given 
by  Cgd\  =  W  ■  Lq  ■  Cox-  Assume  LD  =  lateral  diffusion  =  0.12  um  and  Cox  =  1 
fF/um   in  our  0.8um  CMOS  process.  Then 


Cgd\  —W-LD-  Cox 


,2 


=  590  um  x  0.12  um  x  1  fF/um 

=  70.8/F  (3.33) 

Cdhi  is  the  drain-to- substrate  capacitance  and  is  given  by 

Cm=     .    Cf"     =  (3.34) 

\fl  +  Vdb'/+0 

Cdho  =  Cjswo  x  drain  —  perimeter  +  C/o  X  drain  —  area  (3.35) 

Here,  Cj^o  and  Cm  are  the  (zero  bias  sidewall  capacitance)/«»7  and  the  (zero  bias 
junction  capacitance)/w«?2  and  are  given  in  our  0.8um  CMOS  process  to  be  1  fF/um 
and  0. 18  fF/um2  respectively.  Also,  we  can  assume  that  the  design  rule  specifies  the 
height  of  the  drain  area  to  be  9  um.  Subsituting  this  information  into  (3.35),  we  have 
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CdbO  =  W/um(590  urn +  9  urn +  9  urn)  +  0ASJF/um2(590um  x  9um) 
=  608./F  +  956./F 
=  1564/F  (3.36) 

Furthermore,  we  note  that  V dh=V gsj,—ovtxAn\Q  voltage  of  M3. 


Since  we  want  to  save  power,  we  want  to  make  /D3  as  small  as  possible. 
Consequently,  we  make  the  assumption  that  ID3  is  small  enough  that  the  first  term 
in  (3.36a)  can  be  neglected.  Hence, 

Vdh~VT=W  (3.36b) 

(Vj  =  IV  is  given  in  our  0.8um  CMOS  process).  Finally,  \|/0  is  also  given  in  our 
0.8um  CMOS  process  to  be 

v|/0=  0.65  V.  (3.37) 

Hence,  substituting  (3.36),  (3.36b),  and  (3.37)  into  (3.34),  we  get 

Cm  =  981/F  (3.38) 

Substituting  (3.33)  and  (3.38)  into  (3.32),  we  have 

Cpdi  =981  fF  + 70.8  fF=  1052  fF  (3.39) 

The  preceding  calculations  highlight  that  for  large  devices,  the  source  and  the 
drain-to-substrate  capacitances  become  important.  We  can  now  substitute  (3.39) 
and  (3.30)  into  (3.31)  and  solve  for  g„nRop  that  will  achieve  af-3dB  =  \-9GHz 

7-3dB  =  l-9GHz  —  — — — 

271CPD1 

>      gM,Rop  =  9.5  (3.39a) 


750  X  gM3^op 

27i(1052fF) 


Assume  that  r03   ^C   r0$,  then  from  (3.10e)  Rop  w  r0i.  Substituting  this  into 
(3.39a),  we  have 

gm3ro3  =  9.5  (3.39b) 
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Next  we  calculate  ro3.  To  calculate  ro3  we  need  Im .  As  stated  previously,  since 
we  want  to  have  a  limited  power  consumption,  we  want  a  small  1^3  [actually 
to  solve  for  (3.36a)  we  have  set  I03  ~  0].  One  simple  choice  of  ID3  to  achieve 
this  is  to  make  Im  approximately  equal  to  ID2  (=  0.12  mA,  obtained  from  step  2). 
With  this  /D3  we  can  calculate  ro3: 

ro3  =  - —  =  -. r— =  41 .5£Q  (3.40) 

MD3       (0.2V-1)  (0.12mA) 

(the  channel  length  modulation  factor  A,  is  given  as  0.2  V~    in  our  0.8um  CMOS 
process). 

Finally,  we  can  substitute  (3.40)  in  (3.39b),  which  gives  us 


9.5  9.5  1 

n3 


ro3      41.5UQ      4.3/fcQ 


(3.40a) 


Consequently,   with  ID3   —   0.12   mA   and  gm3  —  j^jj,    (j;)3    can   now   be 
determined: 


12  x  0.12mA  x  k'  x     —      -+  (  —     =  2.2 


4.3/tQ       V  \L/,        \L/^  O.&um       O.&um 

(3.40b) 

5"?e/7  4:  Checking  that  Sjj  is  satisfied 

Finally,  by  putting  a  /^match  =  50  Q  at  the  input,  we  have  transformed  Rm(—  oo)  to 
R[n  staMe(=  50  Q).  Hence,  assuming  that  Z0  =  50  Q,  from  (3.2c)  Sn  =0,  and  the 
specs  on  Sn  are  satisfied. 

At  this  point  we  have  finished  our  design  on  this  wideband  LNA. 


3.3     NarrowBand  LNA:  Impedance  Matching 

In  this  and  the  following  section  we  look  at  the  principle  of  narrowband  LNA  design. 
The  design  is  again  based  in  CMOS.  Since  the  signal  occupies  a  narrow  bandwidth, 
we  only  need  to  provide  impedance  matching  as  well  as  amplification  in  this  narrow 
bandwidth.  This  can  be  done  rather  effectively  with  the  principle  of  resonance. 
Here  the  reactive  part  of  the  impedance  is  controlled  to  be  nulled  out  at  the  resonance 
frequency,  leaving  only  the  resistive  part  to  be  matched  to  the  resistive  source 
resistance  Rs.  Since  the  amplification  factor  is  a  product  of  the  device  transcon- 
ductance  and  load  impedance,  when  the  load  impedance  is  controlled  to  peak  up  via 
the  resonance  process,  the  amplification  factor  peaks  up  as  a  result. 

There  are  various  ways  of  achieving  resonance.  One  simple  way  is  the  use  of  a  LC 
tank  circuit.  This  approach  has  the  added  advantage  in  a  practical  LNA  as  there 
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already  exists  in  the  circuit  some  parasitic  capacitance.  The  idea,  of  course,  is  to  add 
some  extra  inductance  and  combine  it  with  this  parasitic  capacitance,  hence  achiev- 
ing resonance.  This  is  especially  attractive  at  RF,  since  inductance  in  integrated  form 
and  with  high  enough  Q  becomes  available.  This  approach  basically  makes  good  use 
of  this  undesirable  parasitic  capacitance  inherent  in  the  circuit  by  tuning  it  out. 
This  is  just  another  way  of  saying  that  one  makes  the  resulting  capacitance/induc- 
tance network  possess  a  zero  impedance  at  the  resonance  frequency  (a  consequence 
of  resonance)  and  as  a  result  the  parasitic  capacitance  is  rendered  harmless.  In  this 
section  we  explain  the  behaviour  of  the  impedance  of  this  network  under  resonance. 
As  in  Section  3.2,  we  decide  to  design  the  matching  network  for  power 
and  stability  consideration.  We  further  assume  that  this  will  give  us  close  to 
optimum  noise  performance.  How  do  we  design  the  matching  network  for  the 
present  amplifier?  First  let  us  revisit  the  idea  of  power  matching.  In  sub-section 
3.1.2.3  we  explained  the  concept  of  matching  and  how  it  can  be  achieved  by  making 
Rin  =  Rs.  The  concept  was  then  applied  to  the  design  of  a  wideband  LNA  for  DECT 
application  whereby  to  achieve  an  impedance  match  across  the  entire  1.9  GHz 
band,  a  real  physical  50  Q  resistor  /?matCh  is  placed  at  the  input.  In  the  present  case, 
we  only  need  to  achieve  an  impedance  match  in  a  narrowband  around  the  input 
RF  (or  around  the  carrier  frequency  cot).  Hence  the  idea  is  to  make  the  input 
impedance  look  like  a  50  Q  resistor  around  this  co£ .  On  the  other  hand,  since  the 
input  impedance  is  not  simply  resistive,  this  complicates  the  matter  because  we 
have  to  match  both  the  real  and  imaginary  part  (the  reactance)  as  well.  To  satisfy 
power  matching,  the  condition  as  specified  in  (3.6)  is  modified  to  Zin  =  Zs. 


3.3.1     Matching  the  Imaginary  Part 

Let  us  find  out  what  is  involved  in  matching  the  reactive  part  of  the  impedance. 
To  do  matching  the  LNA's  input  impedance,  Zin  is  selected  to  have  a  conjugate 
match  with  the  source  resistance,  whose  imaginary  part  is  zero.  First  let  us  try  to  do 
matching  by  placing  an  inductor  L  in  front  of  a  MOS  transistor  [Figure  3.4a]  whose 
equivalent  circuit  is  shown  in  [Figure  3.4b].  The  C^  of  M\  is  relabelled  C.  It  can  be 
shown  that  the  series  LC  circuit  produces  a  narrowband  output  characteristic  and 
eliminates  the  need  for  a  separate  filter.  However,  the  LC  circuit  always  generates 
an  imaginary  part  in  Zin  and  no  matching  is  possible  for  a  given  cot..  An  improved 
circuit,  where  the  inductor  is  moved  to  degenerate  the  source,  is  shown  in  Figure 
3.5a.  Figure  3.5b  is  its  small-signal  equivalent  circuit.  Phasor  representation  is  used 
for  the  voltage  and  current.  As  shown  in  Figure  3.5b,  the  input  impedance  Zin  is 
solved  by  writing  KVL  in  its  phasor  form  across  the  input  loop: 

Vm  =  Iin(joiL)  +  Im  L^\  +  IJojL  (3.41) 
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Mi 


(a) 

Fig.  3.4a    LNA  using  inductor  at  the  gate  for  matching 
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Fig.  3.4b    Small-signal  equivalent  circuit  of  LNA  using  inductor  at  the  gate  for  matching 
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Fig.  3.5a    LNA  using  source  degrenerated  inductor  for  matching 


At  the  output  loop 


'o         Sm'gs  —  §m*'\n 


1 

jcoC 


Substituting  (3.42)  into  (3.41), 


(3.42) 


Vi„  =  /, 


in         1  in 


jcoL 


1       .   gmL~ 

jcoC       C 


(3.43) 
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Fig.  3.5b    Small-signal  equivalent  circuit  of  LNA  using  source  degenerated  inductor  for  matching 


Therefore 


Zin  =  —  =  jcdL  - 

-*in 


1        gmL 
jcoC       C 


(3.44) 


What  is  surprising  from  the  preceding  equation  is  that  we  can  actually  get  a  real 
term,  gm  k ,  out  from  only  reactive  components:  L  and  C.  The  mystery  of  this  actually 
lies  in  the  voltage-controlled  current  source  (the  gm  source).  To  understand  this  better, 
let  us  first  redraw  Figure  3.5b  in  Figure  3.6a.  The  phasor  diagram  of  the  currents  and 
the  voltages  in  Figure  3.6a  is  then  drawn  in  Figure  3.6b.  Figure  3.5b  has  been  redrawn 
in  Figure  3.6a  by  viewing/in  as  consisting  of  two  components:  IL  and  —I0.  Let  us  start 
off  by  assuming  that  this  input  phasor  Lm  has  an  angle  of  90°,  as  shown  in  Figure  3.6b. 
Referring  back  to  Figure  3.6a  this  current  flows  through  capacitor  C  and  generates  a 
voltage  Vgs  that  is  90°  lagging  [as  shown  in  Figure  3.6b].  This  voltage  Vgs  generates  an 
in-phase  current  I0  via  the  voltage  control  current  source.  Therefore  /in  generates  an/0 
that  is  90°  out  of  phase.  If  we  revisit  Figure  3.6a  and  apply  KCL  at  the  source  node,  it 
can  be  seen  that  /in  consists  of  a  90°  out  of  phase  component  —Ia,  and  a  second 
component  IL.  This  second  component  of  the  current  in  Im  is  responsible  for  generating 
a  voltage  across  C,  which  we  denoted  as  Vc.  This  same  current  flows  from  C  into  L  and 
generates  a  voltage  across  L,  which  we  denoted  as  VL.  The  idea  is  to  make  these  two 
voltages  equal  and  cancel  one  another.  When  this  happens  we  call  the  condition  the 
resonance  condition.  Then  the  voltage  across  C  and  L  comes  only  from  the  voltage 
developed  across  C  due  to  — 10.  This  l„  current  generates  a  voltage  that  is  90°  out  of 
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Fig.  3.6a    Alternative  viewpoint  of  small-signal  equivalent  circuit 


Fig.  3.6b    Phasor  diagram  showing  7in  and  Vm  in  phase  under  resonance  condition.  This  means 
input  impedance  becomes  resistive 


phase  again  (by  flowing  through  a  capacitor,  which  shifts  phase  by  90°)  and  now 
becomes  in  phase  with  the  incoming  7in.  Thus  the  impedance  looks  resistive  and  the 
imaginary  part  is  zero. 

To  highlight  this  condition  mathematically,  we  start  off  by  applying  KVL  to  the 
input  loop  of  Figure  3.6a  and  write 
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Fig.  3.6c  Phasor  diagram  showing  a  situation  when  Vin<Vgs.  Under  this  situation  the  resonant 
circuit  amplifies  Vin  by  a  factor  Q  to  obtain  Vgs.  Q,  the  quality  factor,  becomes  the  voltage  gain  of 
the  matching  network 


Vm  =  —^  +  lijmL  = — h  IijwL  : 

jcoC  jcoC 


jmC 


ItfcoL 


jwC 


(3.45) 


Under  resonance  condition  the  two  terms  in  the  bracket  cancel  out.  Therefore, 


Vin  = 


jcoC 


(3.46) 


But  from  (3.42)  we  have  I0  —  gmVgs  =  g„,  jg^ ;  therefore,  /„  and  /in  are  90°  out  of 
phase. 

Substituting  this  in  (3.46),  Vm  —  —q^Im  and  the  impedance  indeed  looks  resistive. 

We  can  also  look  at  the  resistive  interpretation  from  an  alternative  way.  As  stated 
before,  Figure  3.5b  was  redrawn  in  Figure  3.6a  to  highlight  that  Im  contains 
2  components:  IL  and  —l„.  Let  us  interpret  these  two  components  in  an  alternate 
manner.  From  Figure  3.6a, 


8n,Vg. 


1   ■     ri 

jcoC 


(3.47) 


That  is  Im  generates  an  I0  that  is  90°  out  of  phase  (I0  is  90°  lagging),  as  shown  in 
Figure  3.6b.  If  we  can  further  make  /„  generate  a  Vin  that  is  90°  leading,  then  the 
phasor  will  be  rotated  back  and  V;n,  /;„  will  be  in  phase  again.  Consequently,  Zin, 
which  by  definition  =  j^ ,  is  in  phase  or  is  resistive. 

To  see  how  we  can  make  I0  generate  a  Vin  that  is  90°  leading,  let  us  return  to  Figure 
3.6a  and  observe  that  I„  flows  into  the  capacitor  C  and  generates  a  voltage  —  A? .  As 
shown  in  Figure  3.6b,  this  rotates  the  phasor  l0  counterclockwise  by  90°  and  hence  will 
be  in  phase  with  /in.  If  this  is  the  only  voltage  component  of  Vm,  the  resistive  input 
condition  is  already  met  (i.e.,  Zin  already  looks  resistive).  However,  from  Figure  3.6a  it 
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is  seen  that  V;n  contains  other  voltage  components.  First,  the  current  IL  also  flows  into 
C,  generating  a  voltage  Vc  —  -fi^ ,  whose  phasor  representation  is  shown  in  Figure 
3.6b.  This  can  be  interpreted  as  the  reactive  component  of  Vgs.  Second,  the  same 
current  IL  also  flows  into  the  inductor  L,  generating  a  voltage  VL  =  IJcoL,  whose 
phasor  representation  is  shown  in  Figure  3.6b.  Fortunately,  the  capacitor  rotates  the  IL 
phasor  clockwise  by  90°  to  generate  VL  and  the  inductor  rotates  this  same  IL  phasor  in 
the  opposite  direction  (i.e.,  counter  clockwise  by  90°)  to  generate  Vc.  Hence  they  will 
be  pointing  in  the  opposite  direction  and  cancelling  each  other.  If  the  cancellation  is 
exact,  then  these  other  voltage  components  in  Vin  do  not  matter  and,  as  discussed 
previously,  Zin  does  look  resistive.  It  turns  out  that  since  the  two  voltage  components 
have  the  same  current  phasor,  IL,  this  cancellation  can  be  achieved  by  simply  making 
-r^p  =  —jcoL.  Of  course,  this  occurs  when  coc  =  -A= ,  or  at  resonance. 

In  summary,  as  shown  in  Figure  3.6a,  the  input  current  phasor /in  is  converted  to  a 
voltage  phasor  Vgs  that  is  rotated  clockwise  by  90°  by  a  capacitor  (undesired  phase 
conversion).  This  capacitor  voltage  turns  out  to  be  a  control  voltage  and  hence  is 
converted  to  a  current  I0  (essential  operation)  by  the  voltage -controlled  current 
source.  The  current  phasor  is  converted  back  to  a  voltage  phasor  that  is  rotated 
counterclockwise  by  90°  (correction)  by  making  the  current  flow  through  a 
capacitor.  This  is  made  the  sole  component  of  Vm  by  cancelling  the  other  components 
in  Via,  which  is  achieved  by  forcing  the  remaining  current  component  in  the  current 
phasor  flows  through  an  inductor.  Upon  cancellation,  the  original  input  current  /in 
phasor  goes  through  two  equal  and  opposite  phase  conversions,  90°  clockwise 
followed  by  90°  counterclockwise,  and  generates  a  Vin  that  is  in  phase. 

Finally,  it  should  be  noted  that  the  input  resistance  is  only  an  ac  resistor  (i.e.,  it 
does  not  generate  noise).  Therefore,  we  can  terminate  Rs  with  this  input  resistance 
without  introducing  thermal  noise  [contrasted  this  with  the  case  depicted  in 
Figure  3.3a].  In  return,  we  suffer  from  the  potential  disadvantage  that  the  matching 
occurs  only  at  one  frequency.  Fortunately,  the  matching  is  nearly  perfect  even  when 
we  look  at  a  narrowband  around  this  frequency.  Since  the  signal  is  narrowband 
enough,  we  can  assume  that  matching  is  essentially  achieved. 


3.3.2    Matching  the  Real  Part 

The  preceding  sub-section  describes  only  half  of  the  matching  condition:  it  simply 
means  that  the  input  impedance's  reactive  component  is  zero,  which  matches  with 
the  Rjs  reactive  component  (which  of  course  is  zero).  We  still  have  to  match  the  real 
part.  Under  resonance  condition  what  is  the  real  part?  From  (3.44)  this  is  gm^. 
Accordingly,  for  matching  we  set  Rs  equal  to  this.  In  real  life  the  inductance  L  can  be 
implemented  using  a  spiral  inductor  (to  be  covered  later).  However,  the  inductance 
available  from  the  spiral  inductor  structure  is  on  the  order  2  to  5  nH  and  may  not  be 
sufficient  to  satisfy  the  aforementioned  matching  criteria.  Therefore,  it  is  customary 
to  combine  the  circuit  in  Figure  3.4a  and  3.2a,  which  becomes  the  circuit  as  shown  in 
Figure  3.7a.  Here  L2  is  the  same  as  L  before  and  is  done  using  the  spiral  inductor. 
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Fig.  3.7a   LNA  using 
inductor  at  the  gate  and 
source  degenerated  inductor 
to  achieve  matching 
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(b) 

Fig.  3.7b    Small-signal  equivalent  circuit  of  LNA  using  Lj,  L2  for  matching 


Lj  is  done  using  the  bond  wire  inductor.  The  reason  for  the  choice  of  this  inductor 
will  be  covered  later.  Figure  3.7b  is  the  small-signal  equivalent  circuit  of  Figure 
3.7a.  Applying  KVL  to  the  input  loop  in  Figure  3.7b,  we  have 


Vin  =  JiD.{j(0L\  +J0JL2)  +  /in  f  — 


IJcoLz 


(3.48) 


Independently, 


'o  —  §m*  gs  —  §mi'\\ 


jcoC 


(3.49) 
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Substituting  (3.49)  in  (3.48), 


Vm  =  h 


in  —  'in 


j(o(Li  +L2) 


1     .  gmf-a 


jmC        C 


(3.50) 


Therefore,  Zin  can  be  calculated  as 


Zin  =  —  =  ja>(Li  +  L2)  +  — -  +  — — 

7in  JCOC  C 


For  matching,  Z;n  =  Rs  and  so 


coc(Li  +  La)  =  — —  or  (L{  +  L2)C  =  -^ 


a>cC 


(Of. 


(3.51) 


(3.52) 


and 


,  c 


(3.53) 


From  the  preceding  equation  it  can  be  seen  that  matching  occurs  only  at  one 
frequency  (oc.  This  is  also  the  resonant  frequency  of  the  circuit  formed  by  L\  +  L2,  C. 


3.3.3    Interpretation  of  Power  Matching 


In  the  preceding  sub-section  we  explained  power  matching  from  a  phasor 
cancellation  point  of  view.  The  condition  leads  to  the  resonating  condition  of  the 
LC  tank  network.  In  this  sub-section  we  explain  power  matching  from  a  resonance 
point  of  view.  To  explain  how  resonance  helps  save  power,  let  us  borrow  an  idea 
from  the  ac  power  generator  discipline:  the  use  of  resonance  to  improve  the  load's 
power  factor.  As  in  the  power  generator  case,  we  want  to  maximize  the  power 
transfer  to  our  load,  the  capacitor  C.  However,  C  is  a  storage  element.  This  means 
that  if  we  apply  a  sinusoidal  signal  to  this  element,  it  will  take  in  energy  from  the 
source  in  the  first  half-cycle  and  release  energy  back  to  the  source  in  the  second 
half-cycle.  This  means  energy  (or  current)  is  shuffled  back  and  forth  from  the 
source  and  is  not  contributing  to  anything  useful,  except  burning  power  in  the 
source  resistance  Rs.  To  remedy  this  situation,  one  approach  is  to  provide  a  local 
energy  storage  element  that  acts  in  a  complementary  (push-pull)  fashion  to  that  of 
the  capacitor.  One  convenient  choice  is  an  inductor  that  releases  energy  in  the  first 
half-cycle  (when  the  capacitor  absorbs  energy)  and  absorbs  energy  in  the  next  half- 
cycle  (when  the  capacitor  releases  energy).  If  the  capacitor  and  inductor  act  in 
unison,  that  is,  if  the  release  of  energy  from  one  element  is  synchronized  with  the 
absorption  of  energy  from  the  other  element  then  all  the  energy  transfer  is  local,  and 
no  energy  transfer  occurs  between  the  source  and  the  elements.  This  is  another 
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interpretation  of  resonance  where  we  can  see  that  as  far  as  the  source  is  concerned, 
all  energy  transfer  is  local.  Hence  if  we  put  a  voltage  source  across  the  LC  tank,  no 
energy  is  delivered  from  the  voltage  source  to  the  LC  tank,  which  also  means  no 
current  flows  from  the  voltage  source  to  the  tank.  Zero  current  delivered  for  a  finite 
voltage  source  means  impedance  Z;n  =  p1  =  ^  =  oo,  which  agrees  with  the  cus- 
tomary interpretation  of  resonance  {LC  tank  in  parallel  represented  by  Zm  =  oo  at 
resonance).  To  summarize,  putting  inductor  there  helps  to  null  out  the  reactance  due 
to  the  capacitance.  This  results  in  a  minimum  power  wasted  in  Rs.  Therefore,  of  the 
power  delivered  the  maximum  amount  goes  to  C,  the  input  of  the  device. 

We  can  gain  further  insight  into  power  matching  by  comparing  our  present  case 
to  that  in  sub-section  3.1.2.3.  As  a  reminder,  in  sub-section  3.1.2.3,  we  talked  about 
the  concept  of  matching  in  the  case  where  R'in_power  is  real  under  all  frequency. 
We  want  to  reexplore  the  interpretation  in  the  present  case  where  R'in_power  is 
actually  complex  except  at  <±>c when  it  becomes  real.  We  will  find  the  interpretation 
in  sub-section  3.1.2.3  to  be  equally  applicable  in  this  more  general  case.  At  first 
glance,  however,  it  seems  that  as  opposed  to  the  case  in  sub-section  3.1.2.3,  where 
we  use  a  transformer  to  do  matching,  the  present  case  does  not  give  us  the 
maximum  power  output.  We  show  how  we  may  reach  this  erroneous  conclusion 
and  then  show  the  correct  interpretation. 

First,  to  get  us  to  the  misleading  conclusion,  let  us  compare  Figure  3.5a  with 
Figure  3.5b.  In  Figure  3.5a,  where  M\  is  not  inductor  degenerated,  we  assume  that 
Cgs  is  an  open  circuit  at  mc.  We  further  assume  that  Cc  is  a  short  at  <ac. 
Hence  Vgs  —  Xj- .  In  Figure  3.5b,  under  matching  condition,  we  have  Rs  =  Re{Zm)  = 
gm  jf-  and  so  V\n  =  j  Vs.  When  we  apply  KVL  to  the  input  loop  in  Figure  3.5b  then 

Vm=Vgs+V^.  At  first  glance,  we  conclude  from  this  equation  that  |Vin|>|VgJ. 
Then  \Vgs  \<\\VS\.  That  means  \Vgs\  in  Figure  3.5b  is  smaller  than  \Vgs\  in 
Figure  3.5a.  Since  the  same  transistor  M1  is  used  in  both  cases,  they  have  the 
same  gm,  and  therefore  I(1  in  Figure  3.5b  is  smaller  than  \Im,  |  in  Figure  3.5a,  leading 
to  a  smaller  output  power  or  the  erroneous  conclusion.  This  conclusion  is 
misleading  because  our  assumption  that  |  Vin  |  >  I  Vgs  I  is  wrong.  At  first  glance  this 
seems  impossible,  since  V;n  =  Vgs  +  Vl-  However,  the  preceding  sum  is  a  vector 
sum,  not  a  scalar  sum.  This  is  the  major  difference  between  the  present  case  and  that 
described  in  sub-section  3.1.2.3.  To  see  how  |  V^|>|Vin|  is  possible,  let  us  refer  to 
Figure  3.6c,  where  a  small  Vm  generates  a  large  lm  (by  setting  R  =  j^  =  gm^ 
small).  The  large  Im  generates  a  large  Vgs  (by  setting  cocCgs  small).  Then  the 
large  Vgs  generates  a  small  I0  (by  setting  a  small  gm).  Since  I0  is  small  compared 
with  /in,  then  IL  is  almost  vertical.  Now  VL  is  leading  IL  by  90°;  therefore,  VL  is 
almost  horizontal  and  pointing  in  almost  the  opposite  direction  from  Vgs. 
This  means  that  their  vector  sum,  Vm,  is  very  small.  To  summarize,  this  means 
that  even  for  a  small  Vin,  by  setting  the  proper  component  values,  there  can  exist  a 
large  V?s  because  VL  can  be  made  large  and  pointing  in  the  opposite  direction  from 
this  Vgs.  This  will  lead  to  a  large  Vgs  and  hence  large  I„  or  large  output  power. 
This  is,  of  course,  nothing  but  saying  that  in  a  circuit  involving  LC  [Figure  3.5b 
being  an  example],  at  resonance,  the  voltage  (or  current)  across  the  C  (or  through  L) 
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can  be  much  larger  than  the  applied  voltage  (or  current)  (actually  Q  times  larger, 
Q  being  the  quality  factor).  This  is  well  known  in  a  passive  RLC  circuit.  We  just 
manage  to  highlight  that  it  is  equally  valid  in  a  LC  circuit  involving  a  transistor. 


3.3.4    Similarity  Between  Q  (Quality  factor)  and  n  (Turn  Ratio) 

Now  that  we  have  explained  matching  in  both  wideband  and  narrowband  LNA,  it  is 
a  good  opportunity  to  compare  the  two  corresponding  matching  networks: 
transformers  and  resonance  circuit.  Specifically  we  will  show  the  turn  ratio  n  of  a 
transformer  plays  a  similar  role  to  the  quality  factor  Q  of  a  resonant  circuit.  We  can 
show  this  by  first  considering  the  transformer.  Referring  to  Figure  3.2b,  we  can 
show  that  as  n  goes  up,  Vgs  becomes  larger  because 

Vgs  =  nVin  (3.54) 

However,  the  reflected  Rin,  defined  as  Rrefi,  which  is  given  by 

Rln 

Rrefi  =  —^  (3.55) 

goes  down.  Substitute  this  in  the  following  voltage  divider  formula: 

Vm  =  Vs  x      Rreil  (3.56) 

K-refl  +  «s 

We  can  see  that  Vin  becomes  smaller.  According  to  (3.54),  this  reduces  Vgs. 
Hence  an  increase  in  n  affects  Vgs  in  two  opposite  ways.  In  the  end  Vgs  can  be  shown 
to  go  to  0.  The  same  conclusion  can  be  shown  to  be  true  as  n  decreases.  To  see  this 
more  clearly,  substitute  (3.56)  into  (3.54)  and  we  have 

v         nV^  (357) 

Rrefi  +  Rs 

Substitute  (3.55)  in  (3.57)  and  we  have 


r 

nVs^ 

%* 

1t  +  Rs 

VsRia 

n 

^f  +  Rs 

(3.58) 
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As  n  — >  oo,  from  (3.58)  Vgs  is  seen  to  go  to  0.  We  next  rearrange  (3.58)  so  that  it 
becomes 


1 
*    '    R, 

As  n  — ►  0,  from  (3.59)  Vgi,  is  seen  to  go  to  0. 
The  maximum  \ 
This  happens  when 


v"  =  nV*7T**  (3-59) 


The  maximum  Vgs  is  obtained  for  an  n  that  is  determined  by  setting  -gp  =  0. 


(3.60) 
Substitute  this  into  (3.55)  and  we  can  see  that  this  happens  when 

Rrefl=Rs  (3-61)) 

This  is  just  restating  that  maximum  power  transfer  (which  occurs  when  Vgs  is  at 
its  maximum)  occurs  when  RKa  =  Rs,  or  when  matching  occurs. 

Next  we  consider  the  resonant  circuit.  We  go  to  Figure  3.5b  and  see  what 
happens  as  Q  changes.  First  we  want  to  jump  ahead  of  ourselves  a  bit  and  state  a 
few  formulas  that  will  be  derived  later.  Specifically,  they  are  (3.80a),  and  (3.82), 
which  are  rewritten  here  in  the  following  form: 


If  we  compare  (3.54)  and  (3.63),  we  can  see  that  both  n,  Q  can  be  interpreted  as 
voltage  gain  of  the  matching  network  (with  Vin  as  input  and  Vgs  as  output). 
Without  loss  of  generality,  let  us  set  L;=0  in  (3.62)  and  we  have 

Q=^  (3.64) 

Substituting  L2/C  from  (3.79b)  in  (3.64),  we  can  express  R  as 

R  =  y^—  (3.65) 

Q2gm 

When  comparing  (3.65)  to  (3.55),  it  can  be  seen  that  the  matching  network  in 
Figure  3.5b  reflects  the  gm  and  produces  a  reflected  input  resistance  R.  Since  we  are 
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operating  under  resonance,  then  Zin  in  Figure  3.5b  becomes  R.  Therefore,  we  have 
the  following  voltage  divider  formula: 

Vm-V^  (3-66) 

We  have  now  developed  all  the  necessary  formulas  to  see  the  impact  of  changing  Q. 
As  Q  increases,  from  (3.63)  Vgs  increases.  However,  from  (3.65)  the  reflected  resis- 
tance, R,  goes  down.  According  to  (3.66),  we  see  that  this  makes  Vm  smaller. 
Referring  to  (3.63),  a  smaller  Vm  makes  Vgs  smaller. 

Hence  an  increase  in  Q  affects  Vgs  in  two  opposite  ways.  In  the  end  Vgs  can  be 
shown  to  go  to  0.  The  same  conclusion  can  be  shown  to  be  true  as  Q  decreases. 
Again,  to  see  this  more  clearly  substitute  (3.66)  in  (3.63)  and  we  have 

V„=2V*  (3.67) 

gs       j  R+Rs 

Substitute  (3.65)  in  (3.67),  and  we  have 

_i 

■/'  —  +R 
Rearranging,  this  becomes 


, ,         Q    Vs'Q2g. 

vgs  =  —  — — rrr  (3.68) 


Q:. 


Vm=\-i±-  (3.69) 


As  Q  — >  oo  from  (3.69),  Vgs  is  seen  to  go  to  0. 
We  rearrange  (3.69)  so  that  it  becomes 


j     l+RsQ2g„ 


Vgs  =  —  ,   ,   n^  (3-70) 


As  Q  — >  0,  from  (3.70)  Vgs  is  seen  to  go  to  0. 

The  maximum  Vgs  is  obtained  for  a  Q  that  is  determined  by  setting  -Jy  —  0.  This 
happens  when 


G  =  JrV  (3-71) 

Substituting  this  into  (3.65),  we  can  see  that  this  happens  when 

R  =  Rs  (3.72) 

This  is  just  restating  that  maximum  power  transfer  (which  happens  when  Vgs  is  at 
its  maximum)  occurs  when  R  =  Rs,  or  when  matching  occurs. 
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Fig.  3.8a    Differential  implementation  of  a  complete  narrowband  LNA 

In  summary  when  we  compare  the  two  matching  networks  and  observe  the 
behavior  of  Vgs  as  a  function  of  n  versus  Vgs  as  a  function  of  Q  we  note  the 
similarity  between  n  and  Q. 

As  a  final  note,  a  transformer  is  a  voltage  amplification  device  (not  a  power 
amplification  device)  and  a  resonant  circuit  is  also  a  voltage,  not  a  power  amplifi- 
cation device.  It  should  also  be  noted  the  transformer  is  a  wideband  voltage 
amplification  device  whereas  the  resonance  circuit  is  a  narrowband  voltage  ampli- 
fication device.  Therefore,  the  transformer  is  used  in  the  wideband  LNA  and  the 
resonance  circuit  is  used  in  the  narrowband  LNA. 


3.4    Narrowband  LNA:  Core  Amplifier 


Now  that  we  have  finished  discussing  the  matching  network,  we  can  talk  about  the 
core  amplifier.  In  a  narrowband  design,  because  of  its  good  high-frequency 
performance,  cascode  configuration  is  one  of  the  typical  LNA  topology  for  RF 
application.  Using  differential  cascode  pair  and  narrowband  LC  tuning  at  input/ 
output,  we  could  further  improve  the  circuit  performance,  such  as  noise  figure  and 
gain.  Building  these  features  into  the  circuit  described  in  Figure  3.7a,  we  have  the 
final  circuit  as  shown  in  Figure  3.8a.  The  single-end  schematic  is  redrawn  in 
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Fig.  3.8b    Single-ended  half 
of  the  complete  differential 
narrowand  LNA 


Figure  3.8b,  where  Vs  and  50  Q  are  omitted  for  simplicity.  We  would  now  continue 
our  discussion  by  looking  at  design  issues  of  this  core  amplifier  [6]. 


3.4.1     Noise  Figure 

Let  us  take  a  look  at  the  circuit  in  Figure  3.8b  and  derive  its  NF. 

As  shown  by  the  Friis'  formula  ((4.50)  of  Chapter  2),  the  NF  of  any  cascaded 
network  is  dominated  by  the  first  stage.  Therefore,  for  the  time  being  we  assume 
that  the  noise  is  dominated  by  the  noise  from  Mx .  We  further  assume  that  noise  from 
Mi  is  dominated  by  thermal  noise  of  the  drain  current.  If  Mx  operates  in  a  CS 
configuration,  it  was  shown  in  (3.23)  that  NF  is  given  by 


NF=  1 


4£T|  J- 

3   gM, 

4kTR, 


=  1 


3§m,Rs 


(3.73) 


In  the  present  case,  Mx  is  source  degenerated  with  inductor  L2  and  it  also  has  an 
input  inductor  Lx.  To  calculate  the  equivalent  input  noise  generators  for  the  present 
case,  we  make  use  of  the  equivalent  circuit  shown  in  Figure  3.9a.  Figure  3.9a  shows 
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Fig.  3.9a    Small-signal  equivalent  circuit  of  LNA  with  noise  generators 
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Fig.  3.9b    Representation  of  Figure  3.9a  by  two  input  noise  generators 


the  small-signal  equivalent  circuit  of  the  amplifier  shown  in  Figure  3.8b,  with  thermal 
noise  of  drain  current  represented  by  the  current  source  i2d.  The  time  domain  variables 
are  shown  together  with  their  phasor  representations  enclosed  in  brackets  (the  noise 
sources,  of  course,  do  not  have  phasor  representations  and  only  their  mean  square 
values  are  shown).  The  time  domain  variables  are  in  lowercase  and  the  phasor 
representations  are  inuppercase.  Figure  3.9b  represents  Figure  3.9a  with  equivalent 
noise  representation  vf,  if  at  the  input,  followed  by  a  noiseless  small-signal  model  of 
the  amplifier  with  transconductance  Gm.  Here  we  have  gm  defined  as  the  transcon- 
ductance  of  the  device  and  G„,  as  the  transconductance  of  the  whole  amplifier,  which 
takes  into  consideration  the  resonance  effect  caused  by  C,  L.  It  turns  out  that  G„„ 
which  is  defined  as  the  short  circuit  output  current  divided  by  the  input  current  of  this 
new  circuit,  has  to  be  a  phasor  because,  from  Figure  3.6b,  current  I0  and  voltage  Vm 
are  90°  out  of  phase.  This  observation  is  also  valid  for  the  present  circuit,  although  it 
has  an  extra  inductor  L  v .  Hence  G„,  will  be  defined  as  the  phasor  representation  of  the 
transconductance  of  the  whole  amplifier.  This  also  explains  why  next  to  the 
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controlled  current  source,  there  is  only  the  phasor  representation  in  the  bracket 
(G,„VX)  and  that  there  is  no  equivalent  time  domain  representation  (such  as  gmVgs), 
as  was  found  in  Figure  3.9a.  Gm  is  a  key  parameter  that  characterizes  the  circuit.  It  is 
instrumental  in  the  present  case  for  NF  calculation. 


3.4.1.1     Derivation 

The  output  noise  in  Figure  3.9a  and  Figure  3.9b  are  now  calculated  with  a  short- 
circuit  load.  Let  us  short  the  input  of  the  circuit  in  Figure  3.9a  and  Figure  3.9b  and 
equate  their  resulting  average  output  current  squared,  i2.  This  can  be  written  in 
phasor  form  as  \lj*0  or  \\I0\  .  Referring  to  Figure  3.9a,  with  input  shorted  Vin=0, 
then  /;n  =  3»  =  0.  This  implies  Vgs=0.  Since  i2  =  ||/0 1  =  \ g2m | Vgs |  +id  and  with 
Vgs=0  this  becomes  i2d  and  therefore  i2  =Jj.  Referring  now  to  Figure  3.9b  with 
Vin=0,  there  are  two  independent  sources,  vf  and  if.  if  flows  into  a  short  circuit  and 
develops  zero  power  and_can  be  neglected.  Therefore,  the  only  power  source  at  the 
input  port  comes  from  vf,  the  voltage  noise  power.  Hence,  average  input  power  is 

vf.  As  before,  average  output  current  squared  I2  is  \\I0\  ■  Now  at  the  output  port  of 
Figure  3.9b,  it  is  seen  that  I0  =  G,„VX  and  so 

\\10\2=  \GmVx{GmVx)*=  \GmVxG*mVl  =  V-^GmG*m 

(since  -^  =  average  power  at  input  =  vf).  Equating  the  average  output  current 
squared  in  Figure  3.9a  and  Figure  3.9b,  we  have  i2d  =  \G„,\  v2  or 


i2, 


vf  =  —^  (3.73a) 

\Gm\ 

Notice  that  this  is  almost  identical  to  the  expression  derived  for  v2  in  the 
wideband  case,  where  vf  =  ^,  except  here  g2m  is  replaced  by  \Gm\  .  Assuming 
that  thermal  noise  dominates,  then 


Substituting  (3.73b)  into  (3.73a),  we  have 

-_4kT(jgm)Af 


i2d  =  4kT[~gm)Af  (3.73b) 


(3.73c) 


To  further  simplify  the  preceding  expression,  we  need  to  express  G„,  in  terms  of 
gm.  Referring  to  Figure  3.9a  again,  first  a  voltage  Vin  is  applied  to  the  input  (with 
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noise  source  i2d  turned  off),  and  the  output  is  short-circuited.  Hence  an  internal 
voltage  Vgs  is  developed,  which,  upon  multiplication  by  gm  of  the  device,  generates 
a  short  circuit  output  current  Ia  given  as 

h  =  -gnVgs  (3.74) 

Next,  this  same  voltage  Vm  is  applied  to  Figure  3.9b  (with  noise  source  vf,  if 
turned  off).  With  output  short  circuited,  the  short  circuit  output  current  Ia  is  given  by 

I0  =  -GmVx  =  -GmV-m  (3.75) 

Since  the  two  I„  are  identical,  we  equate  (3.74)  and  (3.75),  resulting  in 

gm  =  Gm-^  (3.76) 

Vgs 

Let  us  digress  a  bit  and  find  the  equation  that  relates  Vm  to  Vgs,  To  do  this, 
we  refer  to  Figure  3.9a  again  and  rewrite  Ohm's  law  in  phasor  representation  for 
the  input  loop  as 

Vin  =  ZinIm  (3.77) 

jcoC 


Dividing  (3.78)  by  (3.77),  we  get 

(3.79) 


Vgs  _       1 


Vin         Zinja>C 

So  we  get  yi- ,  but  to  go  further  we  need  to  solve  for  Zin.  To  find  Zin,  let  us  repeat 
the  expression  of  Zin  from  (3.51): 

Zln  =  jco{Lx  +  L2)  +  — -  +  gm  -±  (3.79a) 

ja>C  C 

We  can  redraw  the  representation  of  this  Zm  as  in  Figure  3.10,  which  shows  a 
passive  RLC  circuit  and  where  we  put 

R^gJ^  (3.79b) 


Now,  from  matching  condition Zm  =  gm^k  =  R  and  so  (3.79)  becomes 
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Fig.  3.10    Representation  of  L1+L2  RsSm^/C 

Zin  using  a  passive  RLC 

circuit 


Wv- 


vgs 


Vm      R{jcocC) 


(3.80) 


Let  us  try  to  express  R  of  this  resonant  circuit  in  terms  of  its  quality  factor,  Q 
and  the  resonating  frequency,  coc.  For  an  RLC  circuit  as  shown  in  Figure  3.10 
from  [3]  we  have 


'  (3.80b) 


N/(A+L2)C 


From  (3.80a),  and  (3.80b),  we  want  to  eliminate  L\  +  L2  and  express  Q  in  terms 
of  cac .  The  resulting  expression  is 


1     l1+l2  =  i      1 L  =  IJ_  f38n 

W      /?  V       C  R  cocVC  VC      RojcC  ' 


Finally,  we  can  substitute  (3.81)  into  (3.80)  and  we  have 

vm     j 


(3.82) 


We  can  then  go  back  to  (3.76)  and  solve  for  Gm  in  terms  of  gm.  This  is  done  by 
substituting  (3.82)  into  (3.76),  resulting  in 

Gm=gm-or\Gm\2=glQ2  (3.83) 

Now,  to  derive  vf,  we  substitute  (3.83)  into  (3.73c)  and  we  have 

-^     HkTgmAf 
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Substituting  this  in  (3.22)  with  Ndev  =  -h  will  give  a  NF  of 

HkTgm  2       1 

"^"iwki^1^,*  <3-85) 


3.4.1.2     Insight 

From  (3.85)  it  can  be  seen  that  NF  is  inversely  proportional  to  g„,  and  Q.  Accord- 
ingly, to  obtain  the  best  NF  in  the  present  case,  Q  should  be  set  to  oo.  However, 
from  (3.81)  Q  =  ^-fc-  F°r  a  giyen  mc,  C,  setting  Q  to  oo  means  setting  R  =  0. 
On  the  other  hand,  the  preceding  expression  is  defined  under  matching  condition  for 
maximum  power  transfer,  which  means  that  this  R  must  be  made  equal  to  Rs,  or  Rs 
will  become  0.  In  general,  Rs  is  fixed  and  so  the  best  NF  condition  cannot  be 
achieved.  Hence  we  can  see  that  the  matching  conditions  for  optimal  noise 
performance  and  optimal  power  performance  are  not  identical. 

To  gain  more  insight  let  us  re-express  the  NF  expression  in  (3.85)in  an  alternate 
form,  concentrating  on  the  gmQ2Rs  factor  in  the  denominator.  First,  the  matching  of 
real  parts  dictates  Rs  =  ^r1 .  Also,  Q  =  -^-^ ,  where  R  =  ^r1 .  Upon  substitution, 
the  gmQ2Rs  factor  becomes 

\^mcC)  C        L2(o2cC 

Next,  we  make  use  of  the  imaginary  matching  condition  -^  =  L\  +  La,  and 
(3.86)  becomes 

gmQ2Rs  =  ^-(L1+L2)  (3.87) 

La 

Now  we  substitute  (3.87)  in  (3.85)  to  get 

2  L2  2      1  2L2 
NF=l+-        I       =i+^^^i+     Jl  (3.88) 

3  (Li  +L2)  3  1  +|i  3  Li 

The  last  approximation  will  be  true  if  Li  ^>  L2.  Notice  that  we  finally  express 
NF  in  terms  of  only  passive  circuit  components  Lu  L2.  From  (3.88),  NF  goes  down 
as  Lx  increases  and  L2  decreases.  This  provides  valuable  design  guidelines. 
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3.4.2    Power  Dissipation 

In  this  sub-section  we  derive  the  dependence  of  power  dissipation  on  technology 
and  circuit  parameters  under  matching  condition  and  for  a  given  Vqs  —  Vt  (which  is 
usually  fixed  for  a  design).  First, 

P  =  IdVdd  (3.89) 

where  ID  is  the  drain  current  of  Mx  [shown  Figure  3.8b].  Therefore 

P  oc  ID  (3.90) 

Since  Mx  operates  in  the  saturation  region,  we  have  the  following  ID  equation: 

ID=^kl-(VGS-VT)2  (3.91) 

where  k  —fiCox;  that  is,  Id  oc  fiC0X  j-  for  a  given  Vqs  —  Vt  (fixed  for  a  design,  as 
stated  at  the  beginning  of  this  sub-section).  Substituting  (3.91)  into  (3.90),  we  have 

W 
P<xfiC0Xj-  (3.92) 

Next,  let  us  try  to  express  [iCox  ?  in  terms  of  g„„  C.  We  will  show  that  fiCm  ¥  and 
%  —  are  proportional  to  one  another.  To  show  this,  it  is  easier  to  start  from  the  opposite 

2 

end;  that  is,  from  the  expression   |?  — .  We  first  note  that  C  —  Cgs  of  M\  and  hence 

C  =  2-WLCox  (3.93) 

Also,  for  Mi  its  gm  can  be  expressed  as 

W  W 

gm  =  k  -r  (Vcs  -  VT);  i.e.  gm  oc  /iCox—  for  a  given (VGs  -  VT)         (3.94) 

Substituting  (3.93)  and  (3.94)  into  the  expression  ™  — ,  we  have 

&i?  k  (ncox^)2 1?  ycisi? 

C   ji      '      WLCox     fi         WLCoxfi 

w 

=  fiCoxj  (3.95) 

Reversing  the  proportionality  operator  in  (3.95),  we  have 

W      s2  L2 
!iCoxT^-  (3.96) 

L       C    fi 


3.4     Narrowband  LNA:  Core  Amplifier  147 

Substituting  (3.96)  into  (3.92),  we  get 

g2  L2 
P    ex  ^—  (3.97) 

C    n 

Finally,  we  want  to  express  gm,  C  in  (3.97)  in  terms  of  more  basic  parameters. 
Under  matching, 


which  implies  C  —  — ^  -. 

cocC  v  (o2c{L,+L2) 


coc(L\  +  Lq)  =  — —  which  implies  C  =  —=  77 —       ^  (3.98) 


Also, 


RS  =  77  L2  °r  gm  =  -T-  (3-") 

C  -L2 


First,  substituting  (3.99)  into  (3.97),  we  get 


RSC\  2  1  L2      L2  R2 


\L2  J    C  fi        /1  Lj 
Then,  substituting  (3.98)  into  (3.100),  we  have 

L2  «2  1  L2  R2 

P   oc   — -y^-- rT  = / y  (3.101) 

Now  we  can  regroup  terms  after  the  equality  sign  in  (3.101)  and  rewrite  (3.101) 
in  the  following  form: 


Pa      —        —      — 7 r-  (3.102) 


circuit  parameter 

As  in  the  NF  case,  we  finally  break  down  clearly  the  dependence  off  in  terms  of 
various  groups  of  parameters:  technology,  standard,  and  circuit  parameters.  Examining 
(3.102),  the  first  term  that  includes  the  minimum  feature  size  L  and  mobility  is  fixed 
for  a  given  technology.  As  L  goes  down  (scaled  technology),  P  goes  down  as  well. 
This  is  satisfying  because  it  means  that  as  technology  scales,  power  goes  down.  With  a 
given  standard,  parameters  in  the  second  term,  Rs  and  coc,  are  fixed.  For  example,  with 
the  DECT  standard,  Su  is  given.  From  (3.2a)  this  fixes  Rin,  which  through  matching 
conditions  in  turn  fixes  Rs.  In  the  DECT  standard  co0  is  also  fixed  and  is  set  to 
2n  x  l.9GHz.  Hence,  the  only  degrees  of  freedom  are  Lx,  L2,  the  circuit  parameters 
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as  grouped  in  the  third  term.  From  (3.102)  we  have  the  following  conclusion:  P  goes 
down  as  Lx  goes  up. 

To  summarize,  from  (3.88)  and  (3.102)  we  can  conclude  that  both  NF  and  P 
go  down  as  Lx  goes  up. 


3.4.3     Trade-Off  Between  Noise  Figure  and  Power 

In  the  last  two  sub-sections,  we  derived  the  dependence  of  the  two  most  important 
parameters,  NF  and  power,  of  a  narrowband  LNA  on  the  circuit  parameters.  In  this 
sub-section  we  compare  this  dependence  to  the  wideband  case  as  discussed  previ- 
ously. This  will  uncover  some  fundamental  differences  between  the  two  approaches 
and  further  show  why  a  narrowband  design  enjoys  some  inherent  advantages  in 
achieving  a  better  NF,  power  trade-off  than  the  wideband  case.  This  may  help 
justify  its  use,  in  view  of  its  need  to  have  an  inductor,  which  is  not  trivial  to  realize 
in  silicon. 

The  key  observation  we  make  in  the  comparisonis  that  in  the  narrowband  case, 
the  average  input  referred  voltage  noise  po_wer  vj  ex  gm,  the  transconductance 
of  the  device,  whereas  in  the  wideband  case,  v?  ex  — .  To  show  this,  let  us  reformu- 

'      •  gm 

late  the  expression  for  Gm  in  (3.83)  in  a  different  form  that  highlights  this  depen- 
dence. We  start  off  again  with  (3.83): 

Gm=gm^  (3.103) 

J 

From  (3.81)  we  have,  under  matching  conditions  (i.e.,  R  =  Rs), 

Q  =  W^  (3-104) 

RscocC 

Substituting  (3.104)  into  (3.103),  we  get 

1       1 

Gm  =  gm  d -p.  ~  (3.105) 

RscocC  j 

Now  we  want  to  rewrite  Gm  in  terms  of  inductors  Lu  L2.  This  can  be  done 
because  under  matching  conditions  Rs  can  be  expressed  as  ^Li,  and  if  this  is 
substituted  in  (3.105)  we  can  simplify  the  expression  as 

Gm  =  gm  fm   -\  =  7jF-  (3-106) 


c 


L2cocC  j      jcacL2 
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Table  3.2   Comparisons  of 
wideband  and  narrowband 
LNA 


Pa 


Wideband 


Narrowband 


8m 

"r/t 


Now  substitute  (3.106)  into  (3.73c): 


l(4kT)gmAf 


jmcL2 


(3.107) 


Notice  that  once  <±>c,  L2  is  fixed  from  the  resonance  condition,  the  denominator  of 
(3.107)  is  fixed  and  is  independent  of  gm.  Hence,  from  (3.107), 


vfxgn 


Now  compare  this  with  wideband  case.  From  (3.23)  we  have 

3gn, 

or 


(3.108) 


(3.109) 


2 

vf  oc 


(3.110) 


Comparing  (3.110),  and  (3.108),  vj  (and  hence  NF)  behaves  in  an  opposite  way 
in  its  dependence  on  gm  for  the  narrowband  versus  wideband  cases. 

We  summarize  in  Table  3.2. 

Therefore,  for  the  wideband  case,  if  we  want  less  noise  (i.e.,  smaller  vj)  we  pay  a 
price  by  dissipating  more  power  P.  On  the  other  hand,  in  the  narrowband  case, 
power  actually  goes  down  as  vj  goes  down  and  the  trade-off  between  NF  and  power 
is  broken.  The  underlying  reason  is  that  in  using  the  inductor  we  tune  out  C  (so  that 
it  matches  Rs),  so  the  gain  is  dependent  on  gm  and  therefore  the  input  referred  noise 
is  proportional  to  — . 


3.4.4    Noise  Contribution  from  other  Sources 


We  have  assumed  up  to  now  that  the  dominant  noise  source  is  the  thermal  noise 
from  M\.  It  turns  out  that  the  passive  devices  can  also  contribute  noise.  Inductor  Lx 
is  usually  a  bond  wire  inductor,  which  has  a  high  Q  factor.  Accordingly,  its  noise 
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cp  = 


(a) 

Fig.  3.11a    Representation  of  L3  with  parasitics  using  an  equivalent  network 


Rloa<l-Qind3  Rp 


Fig.  3.11b    Approximation  to  Figure  3.1  la  when  2ind32>l 


contribution  can  be  neglected.  Inductors  L2  and  L3  are  usually  on  chip  spiral 
inductors,  which  are  lossy  and  have  a  low  Q  factor.  By  design,  the  circuit  needs  a 
very  small  L2,  so  the  accompanying  parasitic  resistance  and  its  noise  contribution 
can  also  be  ignored.  Hence,  the  only  significant  noise  source  from  the  passive 
devices  may  be  from  L3. 

From  the  Friis'  formula,  [(4.50)  in  Chapter  2],  we  know  that  if  gain  of  Mi  is  large, 
the  source  resistance  andM/s  thermal  noise  are  the  dominant  noise  sources,  unless 
the  thermal  noise  of  M2  and  L3  is  much  larger  than  that  of  M;.  We  ignore  noise  from 
M2  for  the  time  being.  Turning  to  L3,  we  begin  by  referring  to  Figure  3.8b,  where  we 
would  like  to  find  out  the  noise  contribution  from  the  noise  sources  due  to  the  loss  in 
the  output  inductor  L3.  We  will  show  that  the  input  referred  equivalent  noise 


resistance  Req  of  L3  equals 


where  Qmd3  =  (Or    is  the  quality  factor  of 


lG>»l2eLA ' 

the  inductor  L3  This  should  not  be  confused  with  Q,  the  quality  factor  of  the  input 
impedance  Zm.  Rp  is  the  parasitic  resistance  associated  with  L3  due  to  substrate  loss. 
First  we  recognize  that  inductor  L3  is  implemented  as  a  spiral  inductor  on  the 
silicon  surface.  Due  to  substrate  loss,  the  ideal  L3  is  replaced  by  its  equivalent 
circuit,  as  shown  in  Figure  3.11a,  where  Cp,  Rp  are  the  parasitic  resistance  and 
capacitance  associated  withL3.  If  Qm^i  ^  1>  Figure  3.11a  becomes  Figure  3.11b, 
where  Rp  is  replaced  by  R\oai  =  Qln^Rp-  This  can  be  derived  easily  by  equating  the 


3.4     Narrowband  LNA:  Core  Amplifier 


151 


Fig.  3.12a    Narrowband 
LNA  redrawn,  separating  the 
transconductor  from  the  load 


load      °  1 


Ci    ± 


C  I 


M2 


V. 


Sd 


transconductor 


I" 


input  impedance  seen  in  Figure  3.11a  and  Figure  3.1  lb  while  keeping  in  mind  that 
QincB  3*1  (for  small  Rp).  The  impedance  in  Figure  3.1  la  is 


Zin(s)  = 


sLi+Rp 


sL3 


s2L3Cp  +  sRpCp  +  1      s^C,,  +  sRpCp  +  1 
(forQilld3  >  1) 


The  impedance  in  Figure  3.11b  is 

Zin(s)  = 


sU 


s2L3Cp  +  sL3/Rioad+  1 


(3.110a) 


(3.110b) 


In  order  to  get  the  same  impedance  (in  a  steady  state),  we  equate  Zin  in  (3.1 10a) 
to  Zin  in  (3.110b).  We  then  observe  that  L3lR\0^=RpCp  at  mc.  Substituting 


Q, 


indi 


/LlCp 


in  this  expression,  it  follows  that  the  equivalent 


load  resistance  is  given  by 


Rload  -  Qi„dlRp 


(3.111) 


152 


3     Low  Noise  Amplifier 


@QnV: 


T 


XL+C,      >U 


Ri 


loa<i  =  ^  ind3"Rp 


(b) 


Fig.  3.12b    Small-signal  equivalent  circuit  of  Figure  3.12a 


-L  C„  +  C, 


RioaA  n 


AkTtsf/ 

l  ~     A. 


(a) 

Fig.  3.13a    Small-signal  equivalent  circuit  of  Figure  3.12a  with  noise  generators 


Noiseless 

(b) 

Fig.  3.13b    Representation  of  Figure  3.13a  by  two  input  noise  generators 


Let  us  next  redraw  Figure  3.8b  in  Figure  3.12a.  Figure  3.12a  serves  to  highlight 
the  noise  contribution  from  the  parasitic  resistance  of  L3  by  separating  the  amplifier 
into  the  load  part  and  the  transconductor  part.  Then  L3  in  the  load  part  is  replaced  by 
its  equivalent  model,  as  shown  in  Figure  3.11b.  Similarly,  the  transconductor  part  is 
replaced  by  its  small-signal  equivalent  model.  Combining,  we  have  the  resulting 
circuit  as  shown  in  Figure  3.12b. 

Since  we  are  now  interested  only  in  noise  from  resistor  Rp,  Figure  3.12b  is  next 
redrawn  in  Figure  3.13a,  where  the  noise  source  i\  that  denotes  the  thermal  noise  from 
Rp  is  explicitly  shown.  Following  the  equivalent  representation  of  Figure  3.9a  by 
Figure  3.9b,  Figure  3.13a  is  now  equivalently  represented  by  Figure  3.13b,  where  all 
the  noise_sources  have  been  absorbed  in  the  two  equivalent  input  noise  source  vf,  if. 
To  find  vf,  if,  we  short  the  inputs  and  outputs  in  Figure  3.13a  and  Figure  3.13b,  find  the 
short-circuited  average  output  current  squared  i20,  and  equate  them. 

From  Figure  3.13a, 
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Ru 


From  Figure  3.13b, 


i2o=\Gm\2vf  (3.113) 

Equating  i^  in  both  (3.112)  and  (3.113),  we  have 

|Gm|  v?  =— orvl  =  —  (3.114) 

"-load  |0-„,|   Kload 

Now  we  can  substitute  for  /?ioa<i  from  (3.1 1 1)  into  (3.1 14)  to  get  the  equivalent 
input  referred  noise  as 

^  4kTAf  (3.115) 


|Gm|2eL,3^ 


Comparing  this  with  noise  from  Mx  in  (3.73c),  it  is  seen  that  noise  from  L3,  when 
input  referred,  gets  an  extra  attenuation  due  to  Qmtxi  and  so  normally  the  noise  from  Mi 
still  dominates,  unless  the  substrate  loss  is  substantial  (meaning  that  Qm(x2Rp  is  small). 

We  can  also  find  the  noise-equivalent  resistance  7?eq  of  the  lossy  inductor  L3.  We 
apply  the  definition  of  Req  to  (3.115)  and  we  have 

#eq=  2^ (3.116) 

From  (3.116)  we  observe  that  Reqa^  -  .  Substituting  Rp  in  (3.111)  in  this 
expression  means  that  Req  oc  ^— .  So  in  order  to  minimize  noise,  we  maximize 
^load  or  minimize  substrate  loss. 


3.4.5    Gain 

Let  us  go  back  to  the  circuit  in  Figure  3. 12a.  Assume  that  initially  there  is  no  L3,  at 
high  frequency.  Since  we  have  a  capacitive  load  C\,  it  seems  that  C\  is  going  to 
shunt  output  to  ground  and  will  reduce  gain.  By  using  L3,  we  show  that  we  can  still 
get  very  large  voltage  gain.  The  gain  is  Av  —  Gm  ( ^-^- 1 1 JC0L3  ) .  Here  Gm  is  the 
transconductance  defined  in  Figure  3.12b.  If  we  substitute  G„,  from  (3.106),  then 

a  -  (  l  \    i^k     - Li       1  G1171 

Ar       \jcoL2J  1  -  mlLsC,      L2  1  -  co^C,  (         ' 
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Fig.  3.14a   Small-signal 
equivalent  circuit  of  a  MOS 
transistor  including  gate 
resistance  due  to  high- 
frequency  effect 


— ► 


aS„ 


r 


V 


^ 


a  =  5-10 


(a) 


Fig.  3.14b   Plot  of  input 
admittance  of  a  MOS 
transistor  versus  frequency 


>  f 


From  (3. 1 17)  it  can  be  seen  that  the  output  LC  tuning  circuit  peaks  the  gain  at  co 
In   reality,    we   cannot   achieve   this    gain   because    we   have 


close   to 
neglected 


ofL, 


3.4.6    Other  Real-Life  Design  Considerations 


We  have  now  finished  deriving  essential  design  equations  governing  matching, 
noise,  power,  and  gain  of  the  circuits.  In  this  sub-section  we  discuss  the  more 
important  features  on  models  for  the  active  and  passive  components.  They  have 
some  impact  on  our  design  procedure. 

First,  at  high  frequency  the  gate  of  an  MOS  transistor  starts  to  have  a  resistive 
component,  as  shown  in  Figure  3.14a.  This  is  due  to  distributive  gate  resistance,  and 
the  Cgs  is  replaced  by  a  RC  ladder  network.  This  resistance  R?  will  change  the 
matching  criteria  (R  =  Rg  +  gm^).  The  distributive  gate  resistance  comes  about 
because  the  signal  at  one  end  of  the  gate  has  not  quite  traveled  to  the  other  end  and 
the  gate  potential  is  in  a  non-quasistatic  state.  Hence,  as  shown  in  Figure  3. 14b,  the 
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d 

(a) 

Fig.  3.15a    Physical  construction  of  a  bond  wire  inductor 


L  R 

-f '~v~y-v~i YA «_ 


C2 


R3 


(b) 

Fig.  3.15b    Physical  construction  of  a  spiral  inductor  and  its  complete  equivalent  model 


plot  of  input  admittance  Yin  versus  frequency  /  is  no  longer  a  straight  line  (purely 
capacitive).  Rather  it  bends  and  flattens  out,  indicating  a  resistive  component.  Notice 
that  this  Rg  is  not  a  real  physical  resistor  and  therefore  does  not  contribute  to  noise. 
Second,  we  briefly  discuss  the  two  types  of  on-chip  inductors  available. 
First  there  is  the  bond  wire  inductor,  as  shown  in  Figure  3.15a,  where  the  inductance 
value  is  a  function  of  the  pad  distance  d,  and  bond  angle  p\  Typical  values  in 
packages  range  from  2  nH  to  5  nH,  with  about  1  nH/mm.  The  series  loss  is  about  0.2 
Q/mm  for  a  1  mil  diameter  Al.  The  Q  of  this  inductor  is  about  60  at  2  GHz.  Even 
though  it  has  a  high  Q,  one  major  disadvantage  is  the  reproducibility  and  predict- 
ability of  this  type  of  inductor.  The  second  type  of  inductor  is  the  spiral  inductor  on 
lossy  substrate,  as  shown  in  Figure  3.15b.  Also  shown  is  its  complete  equivalent 
model  [compare  that  with  the  simple  equivalent  model  shown  in  Figure  3.11a]. 
For  this  type  of  inductor,  the  key  design  parameters  are  as  follows: 

•    Inductance:  L  —  Y^  (^seif  +  ^mutual)  where  Lself,  Lmutuai  are  the  self  and  mutual 
inductance 


Quality  factor:  Q  =  a>  | 
Self-resonant  frequency:  /8e]f 


l 

,'LC 


The  existence  of /seif  may  come  as  a  surprise,  but  it  comes  from  the  fact  there 
is  capacitance  associated  with  this  inductor.  Accordingly,  the  inductor  can  self- 
resonate,  rendering  it  useless.  This  capacitance  exists  here  because  there  is  signifi- 
cant parasitic  capacitance  between  the  metal  deposited  on  top  of  the  silicon  surface 
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and  the  substrate.  In  general,  we  have  to  trade  off  Q  with  /seif  since  as  L  goes  up, 
Q  goes  up  but/self  goes  down.  Typical  characteristics  for  this  inductor  include  an  L  = 
1  nH  to  8  nH,  Q  =  3  to  6  at  2  GHz. 

For  inductor  layout  there  are  two  points  to  remember:  (1)  set  the/self  so  it  is  far 
away  from  coc;  and  (2)  you  may  add  a  load  capacitance  Cioad  to  the  output  of  the 
equivalent  model,  which  is  shown  Figure  3.15b.  You  can  see  that  if  this  capacitor 
has  a  high  Q,  it  can  be  used  to  tune  out  the  inductor,  so  that  it  has  very  high 
impedance  (leading  to  high  gain)  at  coc. 

Design  Example  3.2. 

To  illustrate  the  salient  points  of  a  narrowband  LNA  design,  let  us  go  through  a 
design  example.  The  topology  in  Figure  3.8b  is  assumed,  where  Vdd  is  taken  to  be 
3.3  V.  [Note  that  if  we  use  the  differential  circuit  in  Figure  3.8a,  it  will  conduct  twice 
the  current  and  dissipate  twice  the  power.  In  order  to  compare  fairly  with  the 
wideband  design  in  Design  Example  3.1,  which  is  single  ended,  a  single  ended 
implementation  should  be  used.]  The  same  specifications  as  used  in  Design  Example 
3.1  (namely,  Table  3.1)  are  used  here.  This  time  we  decide  to  use  a  0.5  um  CMOS 
technology.  As  in  the  wideband  case,  we  decide  to  select  a  matching  network  that 
optimizes  power  delivery  and  maintain  stability  at  the  expense  of  noise  performance. 

Step  1 :  Designing  L2  using  the  real  part  matching  condition 
We  assume  Rs  —  50  Q.  The  matching  network  due  to  Lx  L2,  M;  transforms  Rin  due  to 
M]  alone  to  R  in_power.  which  is  given  by  (3.5 1)  to  be  ^r1 .  To  achieve  matching,  we 
set  R  in_pOWer  =  Rs  and  we  have 

50a  =  g„,l^  =  ^L2  =  2nfTL2  (3.118) 

where  fT  is  the  unity  gain  frequency  of  the  MOS  transistor  and  is  given  by 

2nfT  =  8-£  (3.119) 


Therefore, 


50Q 
Li^^T  (3.120) 

2nfT 


For  our  0.5  um  technology  we  are  given  fp  —  AGHz.  Hence: 

50Q 


In  x  4GHz 


2nH  (3.121) 


Step  2:  Designing  Lj  using  NF  specification 

Let  us  now  apply  the  second  specification:  NF  <  3  dB.  To  make  use  of  this 

specification,  let  us  invoke  (3.88).  Then  we  have 
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NF=1+ r<2  (3.122) 

3  1  •'  Ll 


2      1 

Solving, 

^<\  (3.123) 

Substituting  (3.121)  from  step  1,  we  have 

L!<0.67nH  (3.123a) 

We  arbitrarily  make 

L1=0.5nH  (3.123b) 

Step  3:  Designing  C=Cgs  and  hence  Of)-,  using  the  imaginary  part  matching 

condition 

Using  the  matching  condition  from  (3.52),  we  have 

a>c(Z,i+L2)  =  —  (3.124) 

0)CL 

Withojc  =  2n  x  1.9Gra<i/s,L1,L2determinedfrom(3. 121),  and(3. 123b),  we  have 

C=^r— - .  = \ =  2.8pF         (3.125) 

co2c(Li  +  L2)      (2n  x  1  SGrad/s) \{0.5nH  +  2nH) 

Now  we  can  determine  i^£)  l . 
First  we  have 


C=Cgs  =  -WLCm.  (3.126) 


Secondly,  we  want  L  to  be  minimum.  Hence  we  set  L  =  0.5  um.  For  this  process 

<  is  given  to  be  2.6  fF /um 

Substituting  (3.125),  value  of  Cox  and  L  in  (3.126),  we  have 

C  2.8pF 

W  =  -~ = , — %, =  3200ww  (3.127) 

^CoxL      1.75fF/umz  x  0.5um 


158  3     Low  Noise  Amplifier 

Therefore, 

f)=^=  (3.128, 

L/j       Q.5um 

Step  4:  Checking  to  see  if  the  power  specification  is  met 

First,  we  find  gm  from  C.  To  do  so,  we  re-arrange  (3.119)  and  substitute  (3. 125)  in  it 

and  we  have 

gm  =  2nfT  xC  =  2nx  AGHz  x  2.8pF  =  70m£T1  (3.129) 

Next,  we  find  the  drain  current  of  M;  and  hence  power  from  following  equation: 


gm=J2k'(^)  IDl  (3.130) 

For  this  0.5  urn  process  we  are  given  k'  =  100 0.  Re-arranging  (3.130)  and 
substituting  (3.129)  and  (3.128)  in  it,  we  have 

p2  (70mQ-')2 

'     -         '      -  -3.8mA  (3.131) 


Hence 


2*©,       2x»^xff 


3.8mA  x  3.3V  =  12mW  (3.132) 


This  is  just  the  power  in  the  amplifier  core. 

If  we  assume  that  we  have  a  similar  bias  and  output  buffer  arrangement  as  in  the 
wideband  case,  which  is  depicted  in  Figure  3.3a,  then  following  similar  arguments 
that  led  to  (3.27a),  we  can  assume 

Plolal=  4PM  =  48mW  (3.133) 

This  slightly  exceeds  the  specification. 

Step  5:  Using  the  gain  equation  and  gain  specification  to  find  capacitance  Cj 
From  Table  3.1,  gain  must  be  larger  than  20  dB  or  10.  Substituting  in  (3.1 17),  we  have 


L2(l  -cacL3Ci) 

For  simplicity,  let  us  arbitrarily  pick  L3  =  L^  which  equals  2  nH  [from  (3.121)]. 
Hence 

L3=  2nH.  (3.134) 

With  this  value  the  input  referred  noise  from  L3's  parasitic  resistance  Rp  will  be 
very  small  and,  as  discussed  in  sub-section  3.4.4,  will  not  affect  our  NF. 
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Substituting  (3.121)  and  (3.134)  in  (3.133a),  we  have 

1 


Rearranging, 


C,> 


l-(2lrfc)(2nH)Ci< 


0.9  0.9 


10 


(27i/f)2x2nH      (2ti  x  1.9GHz)2x2nH 


We  arbitrarily  set 


Finally,  for  simplicity,  we  set 


Ci  =  4pF. 


W 
2       \L/  l 


(3.135) 


3ApF  (3.136) 


(3.137) 


(3.138) 


Step  6:  Checking  that  Sjj  is  satisfied 

In  step  1  we  transformed  Rin  so  that  R[n  stable  =  50Q.  Hence,  assuming  a  Z0  of  50Q, 

from  (3.2c)  Sn  =  0  and  specification  on  Sn  is  satisfied. 

At  this  point  we  have  finished  our  design  on  the  narrowband  LNA. 


3.5     Problems 


3.1  The  following  is  a  differential  wideband  LNA. 

1 T 


'  DD 


:n> 


:f>o 


V    % 


Rf 

4-0- i   -  I    Mi 


T 


Bias 


Ma 


y\ 


i 


M4  _j_Cf 

Rf 
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(a)  Identify  the  type  of  feedback. 

(b)  Develop  the  single-ended  portion  only  of  this  circuit.  For  this  single  ended 
circuit,  under  the  following  conditions: 

1 .  without  feedback  elements  Rf,  Cf 

2.  with  only  feedback  element  Rf  (assume  Cf  is  a  short  circuit) 

(i)  Calculate  the  equivalent  input  referred  current  and  voltage  noise  in 
terms  of  device  paramters:  gm,  Cgs,  Cgci,  R0,  etc  and  circuit 
paramters:  Rf,  Cf  etc.  You  can  neglect  frequency  effect  and  assume 
only  thermal  noise, 
(ii)  Calculate  the  -3  dB  bandwidth  of  this  circuit,  using  the  zero  valued 
time-constant  method,  in  terms  of  device  paramters:  gm,  Cgs,  Cgd,  R0, 
etc  and  circuit  paramters:  Rf,  Cf  etc.  Simplify  the  expression. 

(c)  The  Cf-Rf  feedback  arrangement  in  this  LNA  is  particularly  useful  when  Vin 
has  source  inductances  (i.e.,  between  Vin+  and  the  gate  of  Mi  there  is  an 
inductor;  similarly  between  Vm  and  the  gate  of  M3  there  is  another  induc- 
tor). Explain  why,  with  source  inductances,  having  this  Cf  -  Rf  feedback 
arrangement  is  better  than  having  no  feedback  (but  still  having  a  resistor  Rf 
connected  from  the  gate  of  M;  to  ground  and  a  resistor  Rf  connected  from 
the  gate  of  M 3  to  ground  ). 


3.2  For  the  wideband  LNA  shown  in  Figure  3.3a,  rederive  the  NF,  including  noise 
contribution  from  M2,  M3,  in  terms  of  circuit  paramters:  Rs,  Rop  etc.,  and 


device  parameters: 
dependent  effect. 


gm,  Cgs,  etc.,  when  appropriate.  Include  the  frequency- 


Rf 

r-WV- 


I|    o — * 


<>v0 


Or 


lo 


M,  (W/Ll, 


M.  (W/L). 


ffl 


io  =  HiVi+H2V?+ 


Fig.  P3.1 
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3.3  Shown  in  Figure  P3.1  are  two  wideband  amplifiers  using  different  feedback 
schemes. 

(a)  Calculate  the  equivalent  input  referred  voltage  and  current  noise  sources  at 
the  input  with  feedback,  v*2  i*2,  in  terms  of  the  equivalent  input  referred 
voltage  and  current  noise  sources  without  feedback,  v2  i2. 

(b)  In  the  above  two  amplifiers  because  M/s  I-V  characteristics  has  only  a 
square  law  dependence,  there  is  no  third  order  distortion.  Let  us  now  look  at 
the  differential  implementations  of  the  above  two  amplifiers.  Due  to  the 
differential  structure,  there  is  now  third  order  distortion.  Assume  these 
structures  to  be  memoryless,  calculate  the  IM3  (using  Taylor  series)  of 
these  two  differential  implementations  of  LNA  in  terms  of  circuit 
paramters:  RF,  RE  etc.,  device  paramters:  gm  etc.,  input  amplitude  A  and 
the  overdrive  voltage,  Vgs-Vt-  (Hint:  (4.32)  of  chapter  4  is  a  good  start). 

3.4  This  question  is  related  to  some  qualitative  understanding  of  the  narrowband 
LNA  described  in  the  chapter. 

a.  Implement  the  narrowband  LNA  in  Figure  3.5a  using  another  active  device 
to  replace  M\.  This  active  device's  small-signal  representation  is  such  that 
in  Figure  3.5b,  instead  of  having  a  voltage  controlled  current  source  (gmVgs) 
we  have  a  voltage  controlled  voltage  source  (aVgs).  Can  Zin  still  be 
made  resistive? 

b.  What  is  the  function  of  Cy  in  Figure  3.8b  other  than  providing  gain? 

3.5  Rederive  (3.106)  using  feedback  theory. 

3.6  For  the  narrowband  LNA  shown  in  Figure  3.8b,  rederive  the  equivalent  input 
referred  voltage  noise,  this  time  including  noise  from  L3,  M2  (you  should  also 
consider  the  frequency  effect).  The  answer  should  be  expressed  in  terms  of 
circuit  paramters:  Rs  etc.,  device  parameters:  gm,  Cgs,  etc.,  as  well  as  Q  (Quality 
factor)  of  the  input  circuit. 


Wideband 

Wideband 

Narrowband 

1 -stage  CS 

CS-cascode 

CS;  inductor 

(common  source) 

degenerated 

Gain 

NF 

Matching 

3.7  In  the  following  chart,  compare  the  different  LNA  architectures  according  to 
their  gain,  NF,  and  matching  performances. 

3.8  Shown  in  Figure  P3.2  is  a  wideband  LNA  with  2-stage  and  using  feedback. 
Assume  Rs  to  be  very  small,  Rx  <C  roM  ,  R2  <§C  r0M,  where  roM  ,  roMi  are  the 
device  output  resistances  of  M\,  M2.  Further  assume  that  Rt  is  approximately 
the  same  as  R2,  (W/L)\  and  (W/L)2  are  approximately  the  same. 


162 


3     Low  Noise  Amplifier 


i r 


OD 


i  r. 


Ms 


r 


WV Ih 


V*Q 


« ov. 


© 
1 


J     1 

Mi 

1 


Fig.  P3.2 


a.  Identify  the  type  of  feedback 

In  terms  of  device  parameters:  gm,  rm  Cgs,  Cgci,  Ccu,  etc,  circuit  parameters: 
R\,  R2  etc.  calculate  the  following  [for  b),  c)  ignore  frequency  effects]: 

b.  Forward  gain  a,  and  feedback  factor/ 

c.  Rm 

d.  /_3  dB.  the  -3  dB  frequency 
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Chapter  4 
Active  Mixer 


4.1     Introduction 

A  mixer,  or  frequency  converter,  converts  a  signal  from  one  frequency  (typically 
(flrf)  to  another  frequency  (typically  coif)  with  a  certain  gain.  This  gain  is  called  the 
conversion  gain  (Gc .)  and  is  defined  to  be  the  output  signal  amplitude  at  coif  divided 
by  the  input  signal  amplitude  at  ff>rf.  The  power  gain  of  a  mixer,  G,  which  has 
already  been  defined  in  Chapter  2,  is  related  to  this  Gc.  In  general,  G  is  simply  Gc  , 
unless  the  mixer's  switching  effect  is  considered.  Ideally,  in  a  mixer,  Gc  should  be 
large  whereas  distortion  and  noise  should  be  low. 

If  we  ignore  distortion  and  noise  effects,  one  useful  first-order  model  of  a  mixer  is 
that  of  a  variable  gain-controlled  amplifier  in  which  the  local  oscillator  signal  V\a 
controls  the  gain  while  the  RF  signal  Vlf  is  amplified.  Vlo  is  a  periodic  signal  oscillating 
at  colo.  A  two-port  representation  of  such  an  amplifier  using  a  voltage-controlled 
voltage  source  is  shown  inside  the  rectangular  box  in  Figure  4.1.  The  voltage- 
controlling  function  can  be  expressed  as  the  product  of  G0,  a  scaling  factor,  and  the 
term  S,  where  S  is  a  nonlinear  function  of  V  i0.  Hence  5  is  written  as  S[Vi0] .  Typically  S 
has  the  shape  as  drawn  in  dotted  line  in  Figure  4.2.  To  simplify  matter  we  usually 
perform  piecewise  linearization  so  that  S[V\0]  is  now  approximated  by  the  solid  line  in 
Figure  4.2.  Using  this  approximation,  when  A\0,  the  amplitude  of  Vi0,  is  large  (much 
larger  than  V+  and  VJ),  S  is  modeled  by  using  the  sgn  function  (called  the  sign 
function).  Specifically,  S[V]0]  =  (sgn[Vlo]  +l)/2.  When  A\0  is  small  (within  V+  and 
V-),  S[Vlo]  follows  Vl0  linearly.  Mathematically,  5[Vlo]  =  (Vlo  -  V_  )/  (V+  -  V_  ). 

To  show  mathematically  how  mixing  occurs,  let  us  start  by  following  the 
expression  for  Vif  as  given  in  Figure  4. 1 : 

Vif(t)  =  Vrf(t)G0S[Vl0(t)}  (4.1) 
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Fig.  4.1    Mixer  modeled  as 
variable  gain  amplifier 


Vrf 


-O   + 


-o 


-o  + 


Vc,rlG0S[Vl0(t)] 


Vit{t)  =  VctrlG0S[Vl0(t)] 
=  Vrf<t)G0S[Vl0(t)] 

where  S[Vlo(t)]  =  1  for  Vlo  >  V+ 
S[Vlo(t)]  =  0forVlo<V_ 
Vi„  "  V. 


S[Vlo(t)] 


otherwise 


Fig.  4.2   Plot  of  the 
5-function  versus  the  control 
voltage  Vio. 


First  let  us  assume  Alo  is  large  (much  larger  than  V+  and  V_)  and,  as  stated  in 
Figure  4.1,  S  is  modeled  by  the  sgn  function  of  Flo.  Further,  we  express  V\0  as 
V"i0(t)  =  A]0  cos  miot>  then  S[Vi0]  is  a  periodic  function  oscillating  at  co/0.  Consequently, 
SI Vi0]  can  De  expanded  in  a  Fourier  series  [1]  with  fundamental  frequency  m/0. 

Substitute  this  expansion  in  (4.1)  and  V^  becomes 


Vif(t)  =  V,f(t) 


Go  ,  2G0 


cosco;0(r) 


2Go 

371 


cos3o)/0(r) 


(4.2) 


Let  us  examine  the  second  term  in  (4.2).  If  Vrf  is  expressed  as  Vrf(t)  =  Ar! 
cos  (Brft,  this  second  term  becomes 

si 

2  —  cos  a>t0(t)Arf  cos  &»,•/■(?) 


Doing  trigonometric  expansion  this  term  further  becomes: 
G0  ■  Arf 


cos 


((Ojf  -  COio)t+  COs(ffil,/  +  0)io)t] 


(4.3) 
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It  is  now  obvious  that  frequency  translation  and  hence  mixing  occur.  To  obtain 
the  mixed-down  frequency  component,  we  are  interested  just  in  the  first  term  in  (4.3). 
We  can  do  so  by  following  the  mixer  with  a  bandpass  filter  centered  at 
coif  =  (o,f  —  0j/o.  Then  we  obtain  the  mixed  down  component  at  co,y,  which,  for  the 
sake  of  reusing  old  symbols,  we  simply  denote  as  Va  again.  (From  now  on  Vtf  will  be 
used  interchangeably  for  the  two  cases:  before  filtering  and  after  filtering.  The  proper 
interpretation  should  become  clear  from  the  context.)  Hence  Vif  is  given  by 

G0  •  Aff        ,  >.        G0  •  Ay-f 

V,f  = COS(ffl,f  —  (Dio)t  — COS  CO,/ f  (4.4) 

Applying  the  definition  of  Gc  to  (4.4),  we  get 

Gc=^  (4.5) 

n 

Notice  from  (4.4)  that  Vif  is  independent  of  A\0.  This  is  equivalent  to  saying  that 
the  S  function  in  (4.1)  behaves  as  a  switching  function. 

Secondly  let  us  assume  Alo  is  small  (within  V+  and  V_).  Under  this  condition,  as 
stated  in  Figure  4.1,  S(Vi0 (?))  =  Y/°Zv~  ■  To  simplify  our  discussion,  let  us  for  the 
present  case  set  V+  =  1,  V.  =  0.  Then  we  have  S(Vi„(t))  —  V/0(t)  =  Alo  cos  colot. 
Substituting  in  (4.1),  we  have 

Vif(t)  =  G0Vrf(t)Aio  COS  COht 

Doing  trignometric  expansion  Vif  becomes: 

—A\0Arf  (cos  (mrf  -  coio)  t  +  cos  (ojrf  +  mlo)  t)  (4.6) 

If  we  examine  the  first  term,  again  it  is  apparent  that  frequency  translation  and 
hence  mixing  occur.  Gc  becomes  (Go/2)Alo  Notice  from  (4.6),  however,  that  V-^  is 
now  dependent  on  Alo.  This  is  equivalent  to  saying  that  the  S  function  in  (4.1) 
behaves  like  a  multiplying  function.  Therefore  the  S  function's  behaviour  is 
different  from  the  case  when  Ai0  is  large. 

The  disadvantage  of  having  Vif  dependent  on  Alo  stems  from  the  fact  that  Vlo  is 
usually  generated  from  some  frequency  synthesizer  (to  be  described  in  Chapter  7) 
and  its  exact  amplitude  is  hard  to  control,  leading  to  a  Gc  that  is  hard  to  control.  As  a 
result,  the  rest  of  the  book  is  focused  mostly  on  mixers  that  operate  with  a  large  Ai0 
(much  larger  than  V+  and  V_).  For  these  mixers  because  the  accompanying 
S-function  behaves  as  a  switching  function,  they  are  called  switching  mixers. 
These  switching  mixers  can  also  be  further  classified  according  to  whether  they 
perform  the  switching  in  the  voltage  or  current  domain.  In  this  chapter  we  concen- 
trate on  switching  mixers  based  on  current.  Because  the  mixers  we  consider  have  Gc 
larger  than  one,  we  call  them  active  mixers. 
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To  summarize,  the  basis  of  mixing  lies  in  having  a  variable  gain  amplifier  (shown 
in  Figure  4.1)  whose  gain  can  be  controlled  by  an  external  periodic  signal.  Analyzing 
a  mixer  using  this  representation  involves  specifying  the  amplifier  gain  or  transfer 
term  with  respect  to  the  input  signal  and  then  expanding  the  gain  in  a  Fourier  series. 
This  vewpoint  highlights  the  periodic  time-varying  nature  of  a  mixer.  Taking  this 
viewpoint  one  step  further,  it  is  precisely  this  periodic  time-varying  nature  (rather 
than  the  nonlinear  nature,  as  explained  in  some  books)  of  a  mixer  that  leads  to  mixing. 

Finally,  let  us  explore  the  type  of  electronic  devices  that  will  implement  this 
variable  gain  amplifier.  One  of  the  most  common  electronic  devices  that  can 
implement  a  variable  gain  function  consists  of  using  the  transconductance  of  a 
MOS/BJT  transistor  biased  in  the  saturation/active  region.  This  transconductance 
varies  as  a  function  of  gate/base  bias.  Transconductance  of  a  source-coupled  pair 
(SCP)/emitter  coupled  pair  (ECP)  is  another  choice.  One  other  choice  consists  of 
the  on  resistance  (Ron)  of  a  MOS  transistor  biased  in  the  triode  region. 


4.2     Balancing 

In  this  section  we  will  use  the  concept  of  balancing  to  classify  active  mixers. 

4.2.1     Unbalanced  Mixer 

The  simplest  active  mixer  is  an  unbalanced  mixer,  as  shown  in  Figure  4.3,  where 
the  IF  signal  (/;f)  is  taken  from  one  branch  only.  This  In  signal  flows  into  a  resistor 
RL  (not  shown)  and  develops  V^.  Vjf  can  be  obtained  by  rewriting  (4.2): 

(\      ^sinf  \ 

Vif(t)  =  Alf  cos  Wrft  x  G0  I  -  +  2_^  -^  cos  nco0t  I  (4.7) 


From  (4.7)  it  is  apparent  that  the  DC  component  of  Vi0  (the  1/2  term  inside  the 
bracket)  is  multiplied  to  Vrf,  producing  a  scaled  product  at  Vif.  Hence  frequency 
components  at  the  RF  frequency  appear  at  the  output.  This  phenomenon  is  called 
RF  feedthrough  and  is  undesirable. 


4.2.2     Single  Balanced  Mixer 

An  improvement  over  the  unbalanced  mixer  is  the  single  balanced  mixer,  as 
shown  in  Figure  4.4.  Here  the  IF  signal  is  taken  from  both  branches  (7it+  and  7if"). 
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Fig.  4.3    Unbalanced  mixer 
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Fig.  4.4    Single  balanced  mixer 

Each  /if+  and  7if~  flows  into  a  separate  resistor,  RL+  and  RL~  (not  shown)  and 
develops  Vif+  and  Vif~.  V#  is  taken  as  Vif+  —  Vif_.  Vif  can  now  be  written  as 


Vif{t)  =  A,f  ■  cos  a>rft  ■  2G0  2_^  -^cos  n<a0t. 


(4.8) 


u=l       2 


Because  signals  are  taken  from  both  branches,  RF  feedthrough  from  both 
branches  cancel  one  another.  In  addition  to  the  absence  of  RF  feedthrough,  it  can 
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Fig.  4.5    Alternate  single 
balanced  mixer,  where  LO 
and  RF  ports  are  swapped 
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be  shown  that  in  a  single  balanced  case  even  harmonics  of  the  RF  frequency  are 
missing  in  the  output.  In  addition,  frequency  components  at  ncoi0  from  the  RF  input, 
where  n  is  an  integer,  will  not  propagate  through.  On  the  other  hand,  frequency 
components  at  na>i0  from  the  LO  input,  where  n  is  an  integer,  will  propagate 
through.  This  is  called  LO  feedthrough. 

An  alternative  single  balanced  mixer  consists  of  applying  Vlf  to  the  SCP,  M1_2, 
and  Vlo  to  the  V-I  converter,  M3,  as  shown  in  Figure  4.5.  We  can  obtain  an  equation 
for  Vjf  just  by  swapping  the  Vrf,  V\0  term  in  (4.8).  Some  form  of  duality  exists 
between  Figure  4.4  and  Figure  4.5.  For  example,  the  single  balanced  mixer  in 
Figure  4.5  rejects  LO  feedthrough  but  allows  RF  feedthrough  instead. 


4.2.3    Double  Balanced  Mixer:  Gilbert  Mixer 


Finally,  we  can  combine  two  single  balanced  mixers  and  make  a  double  balanced 
mixer,  also  called  the  Gilbert  mixer  or  the  quad  mixer.  A  double  balanced  mixer 
rejects  both  RF  and  LO  feedthroughs.  The  double  balanced  mixer  is  the  most 
commonly  used  mixer,  so  we  will  focus  on  it  for  the  rest  of  the  chapter. 

The  absence  of  feedthrough  from  RF  to  IF  ports  and  LO  to  IF  ports  in  a  double 
balanced  mixer  occurs  only  under  ideal  conditions.  In  practice,  a  finite  amount  of 
feedthrough  remains.  They  come  mainly  as  a  result  of  mismatches.  Finally,  apart 
from  feedthroughs  between  RF  to  IF  and  LO  to  IF  ports,  feedthrough  between  the 
RF  and  LO  ports  is  also  undesirable  as  it  introduces  reradiation.  This  type  of 
feedthrough  is  present  in  a  double  balanced  mixer. 
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Qualitatively,  a  Gilbert  mixer  consists  of  a  V-I  converter,  a  current  switching  block, 
and  an  I-V  converter.  Figure  4.6  shows  an  implementation  of  the  double  balanced 
mixer  using  three  SCP's.  Two  SCPs  (  M3_4  and  M5_6  )  are  used  to  do  the  switching 
and  one  SCP  (Mi_2  )  is  used  for  V-I  conversion.  The  two  switches  operate  in 
opposite  polarity  since  the  Vi0  signals  are  applied  in  opposite  polarity.  The  V-I 
converter  generates  two  current  inputs  to  the  tail  nodes  of  the  switches,  which  are 
also  180°  out  of  phase.  Notice,  as  discussed  in  Chapter  2,  if  used  in  a  heterodyne 
architecture  this  is  preceded  by  a  BPF  for  anti-imaging.  Hence  a  matching  network 
is  needed,  which  we  have  not  shown. 

To  further  describe  this  circuit,  let  us  follow  a  few  labeling  conventions: 

1.  Input  signals  have  "rf"  subscripts,  control  signals  have  "lo"  subscripts,  and 
output  signals  have  "if"  subscripts. 

2.  For  the  SCP  (Mi-M2)  and  the  output  currents,  along  the  main  line  of  symmetry 
(passing  vertically  through  the  tail  node  of  MrM2)  of  the  differential  circuits,  the 
branch  current  on  the  left-hand  side  has  "+"  superscript  and  conversely  the 
branch  current  on  the  right-hand  side  has  "— "  superscript. 


,+  ,+ 

if,  it 


1.,    +    I 


Fig.  4.6    Gilbert/quad  mixer 
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Fig.  4.7    A  square  wave  V\a 
for  the  Gilbert  mixer. 
The  two  phases  are  shown 


v+-- 
v_ 


-►t 


3.  For  the  second  and  third  SCPs  (M3-M4  and  Ms- 
subscript  and  superscript  distinctions: 


-Me),  there  are  two  levels  of 


3.1  Along  the  main  line  of  symmetry  (passing  vertically  between  M4  and  M5)  of 
the  differential  circuits,  the  branch  current  on  the  left-hand  side  has  subscript 
labeled  as  "1"  and  conversely  the  branch  current  on  the  right-hand  side  has 
subscript  labeled  as  "r". 

3.2  Along  the  secondary  line  of  symmetry  (passing  vertically  through  the  tail 
node  of  M3—  M4  and  M5—M6)  of  the  differential  circuits,  the  branch  current 
on  the  left-hand  side  has  "+"  superscript  and  conversely  the  branch  current 
on  the  right-hand  side  has  "— "  superscript. 

Next  let  us  explain  the  operation  of  the  circuit.  MUM2  form  a  SCP  that  does  a  V-I 
conversion  for  the  V,f  input  signal.  The  7rf+,  Is{~  generated  from  M\,  M2  are  then 
switched  through  the  other  two  SCPs  formed  by  transistors  M3-M6.  Let  us  assume 
Vic  is  a  square  wave  that  is  shown  in  Figure  4.7.  As  shown  it  is  large  enough  to 
switch  transistor  M3_6  totally  on  when  it  is  high  and  totally  off  when  it  is  low.  We 
further  assume  that  It  and  Iz  remain  constant  during  the  LO  period. 

If  we  apply  the  V\0  as  shown  in  Figure  4.7  to  the  circuit  in  Figure  4.6,  then  during 
time  ifil  V\0  is  positive  and  larger  than  V+.  This  means  M4,  M5  are  completely  on, 
with  drain  current  equal  to  tail  current,  which  in  turn  equal  7rf+  and  /rf~,  respec- 
tively. M3,  M6  are  completely  off  and  hence  their  drain  currents  are  zero.  The 
positive  output  current  of  the  mixer,  7if+,  through  the  cross-coupled  connection,  is 


the  sum  of  drain  current  from  M3  and  M5  and  hence  It  =  7f/3  +  Ids  =  0 


(i.e,  it  equals  the  current  from  the  tail  node  of  M5).  The  negative  output  current  of 
the  mixer,  Ia~,  again  through  the  cross-coupled  connection,  is  the  sum  of  drain 

it  +  0  =  It  (i.e,  it  equals  the 


*db 


current  from  M4  and  M6  and  hence  Iz  =  Id4 
current  from  the  tail  node  of  M4). 

Now  Vlo  switches  during  time  4>2  and  exactly  the  opposite  occurs.  Since  Vlo  is 
negative  and  is  less  than  V-,  this  means  M4,  Ms  are  now  completely  turned  off.  On 
the  other  hand,  M3,  M6  are  now  completely  turned  on.  Since  the  pair  of  "on" 
transistors  has  switched,  the  current  that  got  routed  to  the  output  have  also  been 
swapped  around.  Repeating  the  procedure,  the  positive  output  current  of  the  mixer 
Zjf  is  still  the  sum  of  drain  current  from  M3  and  M5.  However,  since  the  current  is  now 
from  the  tail  node  of  M3  as  opposed  to  being  from  the  tail  node  of  M5  then  It  = 
hn  +  Id5  =  It  +  0  =  It   (i.e.  it  equals  the  current  from  the  tail  node  of  M3). 
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Fig.  4.8    Gilbert  mixer  output 
current  as  a  function  of  time 
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Similarly,  the  negative  output  current  of  the  mixer  Iif~  is  still  the  sum  of  drain  current 
from  M4  and  M6,  but  again  the  current  is  now  from  the  tail  node  of  M6  as  opposed  to 
being  from  the  tail  node  of  M4  and  hence  Iz  =  1^  +/d6  =  0  +  IZ  =  Iz. 

In  summary  output  currents  have  swapped  polarity  in  synchronization  with  the 
controlling  Vi0  and  mixing  has  occurred.  This  is  shown  in  Figure  4.8  where  we  have 
assumed  that  /i  and  Iz  remain  constant  during  the  LO  period. 

Assuming  that  /i  and  Iz  amplitudes  are  equal,  then  the/i,  Iz  waveforms  as 
shown  in  Figure  4.8  will  be  completely  symmetrical. 

We  already  mentioned  that,  when  compared  to  the  single-ended  mixer,  this 
Gilbert  mixer  has  the  advantage  that  feedthrough  from  Vy,  Vrf  will  not  get 
propagated  to  the  output. 

In  addition,  this  mixer's  Gc  is  doubled  compared  with  the  single  balanced  case 
and  is  four  times  that  of  the  unbalanced  case. 

We  next  go  through  the  design  issues  of  the  active  mixer.  From  Chapter  2  we 
have  identified  the  following  key  parameters:  power  gain  G,  distortion  (IIP3)  and 
noise  (NF).  We  will  be  developing  formulas  of  these  key  parameters  for  the  active 
mixer  (except  that  we  will  determine  Gc  instead  of  G,  as  Gc  is  more  general  and 
considers  the  mixer's  switching  effect).  The  idea  is  to  translate  these  key 
parameters  into  circuit  component  values  such  as  W/L  ratio  and  bias  current/ 
voltages.  Because  a  mixer  is  a  periodic  time-varying  circuit,  these  parameters  are 
difficult  to  derive  mathematically  in  one  step.  Hence  we  will  first  derive  them  by 
assuming  that  Vlo  is  constant  (not  switching).  We  then  try  to  derive  them  when  Vlo  is 
switching.  Furthermore  in  the  Vlo  switching  case,  V\a  is  assumed  to  have  zero  rise/ 
fall  time.  Relaxing  to  a  nonzero  rise/fall  time  makes  the  closed-form  solution 
practically  impossible  to  get,  and  we  have  to  resort  to  simulations. 
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Refer  to  Figure  4.3  for  the  unbalanced  mixer.  Let  us  assume  that  Vlo  is  not 
switching,  and  we  have  Gc  =  gmj,  x  RL  where  RL  (not  shown)  is  the  resistor  hanging 
from  drain  of  M\  to  VDD.  For  the  Vlo  switching  case  let  us  assume  that  poles/zeros 
frequencies  from  the  mixer  are  much  higher  than/lo.  We  further  assume  that  V\a  is  a 
square  wave  with  a  50%  duty  cycle.  Gc  can  be  obtained  from  the  coefficient  of  the 


172  4     Active  Mixer 

Fourier  series  expansion.  This  turns  out  to  be  l/7t(g,„3  x  RL).  The  conversion  gain 
for  the  double  balanced  (Gilbert)  mixer  is  four  times  that.  Hence,  referring  to 
Figure  4.6  we  have 

Gc  =  4/n(gml  x  RL)  (4.9) 

For  this  Gilbert  mixer,  there  will  be  two  load  resistors  RL  connecting  between  the 
drains  of  M3— M5,  M4-M6  and  VDD. 

Numerical  example  4.1. 

Referring  to  Figure  4.6  let  us  assume  that  Mx—M6  all  have  a  ^  of  ^  um/Q  frum  and 
we  make  VGSl  -Vt=  0.387V,  assume  k'  =  15uA/V2.  Then  k  =  6250uA/V2  gm,  = 
k(Vcs  —  Vt)  =  2AmDT  .  Suppose  we  want  to  design  for  a  conversion  gain  of 
10  dB.  Substituting  this  in  (4.9)  we  get 

4  4 

Gc  =  -gmiRL  =  -  x  2.4mQ-1  x  RL  =  3.16  (4.10) 

71  71 

Solving  7^  =  1034  Q. 
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If  we  no  longer  ignore  distortion  and  noise  effects  in  a  mixer,  the  first-order  model 
as  shown  in  Figure  4. 1  is  no  longer  adequate.  The  distortion  and  noise  effects  can  be 
included  by  specifying  the  IIP3  and  NF  of  a  mixer,  as  was  done  in  Chapter  2.  In  this 
and  the  next  section  we  are  interested  in  taking  the  IIP3  specified  for  a  mixer  in 
Chapter  2  and  find  circuit  parameters  such  as  W/L,  7bias  that  will  satisfy  this 
specification.  In  section  4.7  we  will  repeat  this  process  for  the  NF. 

To  relate  IIP3  to  circuit  parameters  we  need  to  find  the  proper  design  equations. 
For  the  bulk  of  sections  4.5, 4.6  we  assume  that  Vlo  is  not  switching  and  so  the  mixer 
is  represented  by  a  nonlinear  time  invariant  (NLTI)  system.  In  this  section  we  look 
at  the  distortion  behaviour  of  this  NLTI  system  at  low  frequency  while  the 
corresponding  high  frequency  distortion  behaviour  is  reserved  for  the  next  section. 
For  any  NLTI  system,  such  as  a  mixer,  operating  at  low  frequency  means  we 
assume  that  any  capacitors  or  inductors  can  be  neglected.  Returning  to  the  Gilbert 
mixer  in  Figure  4.6  since  we  neglect  capacitive  effect,  then  individual  transistors  in 
the  quad  pair  M3_6  are  either  completely  turned  on  or  off.  Accordingly,  the  output 
current  /;r  equals  either  7rr  or  — 7rf.  Hence  the  switching  SCPs  (M^-Mf,)  in  Figure  4.6 
do  not  contribute  distortion. 

Therefore  distortion  comes  primarily  from  the  bottom  SCP,  which  does  the  V— I 
conversion.  Furthermore,  we  assume  that  this  distortion  is  dominated  by  the 
nonlinear  square  law  I-V  characteristics  of  the  MOS  transistors  biased  in  saturation. 
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Referring  to  the  SCP  M!_2  in  Figure  4.6  we  can  write,  for  transistor  Mi 

itf  =  k2(vg,-v)2 

Next  the  loop  equation  gives  us: 


Kf  -  vgSi  =  v;f  -  vgS2 


Rearranging  (4.12)  gives  us 

vgsi  =  v+  -  v;f  +  vm  =  vlf  +  vgS2 

Substitute  (4.13)  into  (4.11) 


^=9(v+(v«ft-voy 


Repeating  the  procedure  for  transistor  M2 


which  means 


(4.11) 


(4.12) 


(4.13) 


(4.14) 


V»,  -  V, 


Substituting  (4.15)  into  (4.14), 


'f-2 


v,f 


V 


\ 


2(/ss-V£ 


/ 


Let  us  normalize  by  defining  the  normalized  /r"f,  Iss„  as 

2  [+ 


'rf„-      k 

T  2ISS 

'SS„  =  — i— 


(4.15) 


(4.16) 


Substituting  the  normalized  variables  into  (4.16),  we  have 


'f=  n/  +  \/fe„-/i) 


rfn> 


(4.17) 
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After  some  algebraic  manipulation 


i^-y,f  =  \hs„-n 


or  Vrf=JI+n-JISsn-Il 


'SS„ 


rf. 


S+£ 


Uss, 
fas. 


•rj], 


lss„ 


(4.18) 


Here,  irfn+  is  the  ac  part  of  Irfn+.  Notice  that  (4.18)  represents  an  odd  function  of 
Vrf  around  ,Jf- . 

Factoring  out  the  7^  term  we  have: 


Vrf  = 


(4.19) 


(4. 19)  gives  Vjf  in  terms  of  i^,  Since  Vrf  is  input  and  it  is  output,  we  would  instead 
want  to  express  it  in  terms  of  V^.  Since  there  is  no  capacitive  effect,  each  /,/  + 
term  can  be  expanded  as  a  power  series  (also  denoted  as  Taylor  series;  the  two 
terms  will  be  used  interchangeably)  in  powers  of  V^: 


i+n=alVrf  +  a2Vl  +  a^.f  + 


(4.20) 


Here  ai,  a2  ...  are  coefficients. 

Unfortunately,  since  it  is  not  easy  to  write  irf  +  explicitly  in  terms  of  Vrf  [as 
evidenced  in  (4.19)],  we  would  get  around  this  difficulty  by  doing  two  expansions: 

First  we  expand  the  two  square  root  terms  inside  the  bracket  in  (4.19)  around 

2iRF,+  . 


Vrf 


'2\ 


■rfn 
hsn 


1     2i 


rfn 

hsn , 


iff 

16  Us. 


2A 

Iss, 


8  I  Iss.  ) 


1/2/ 


rf, 

hsn  , 


l/r" 

16    7 


rfn 


SS„ 


(4.21) 


Secondly  we  would  expand  each  of  the  i^  +  term  in  (4.21)  using  (4.20). 
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For  simplicity  we  write  down  only  the  first  three  terms  when  expanding  (4.20). 
Upon  substituting  in  (4.21)  we  have: 


V 


1  /  2 


[^a1Vlf  +  a2V^  +  a3V^  +  ...)+-i[js-J   (aiVrf  +  a2V^  +  . . . ) 


(4.22) 


Finally  we  can  solve  for  the  coefficients  al,a2,aj, . . .  by  equating  the  coeffi- 
cients of  V,f.  Vjf,  ...  on  both  sides  of  (4.22).  For  the  Vrf  term 


1         /fe»  (  2 

1  =  V^feflI 


.  .a\  = 


(4.23) 


For  the  Vh  term: 


0  = 


fe„«2 


.'.«2  =  0 


(4.24) 


For  the  Vh  term 


0  =  \l -|  «3 

2   \lSSn    - 


l/_2 

Jss, 


.(33 


1  /  2 


'.S\S'« 


fl?         (4.25) 


Let  us  apply  the  definition  of  HD3  as  given  in  (2.28)  to  the  present  case 
li\3rd  —  order  ■  term      \  a-x    , 


HDt. 


- -  —At 

li\  fundamental  4  a\    '"' 


(4.26) 


Substituting  (4.25)  into  (4.26),  we  have 


HDi, 


1  (  2 


ss„ 


n2  -A2 
ax  ■  Arf 


(4.27) 


Substituting  (4.23)  into  (4.27)  we  have: 


//£>3 


1/2. 

8  W„ 


fe„A2 


■  -,---  — A^  =  -  —A2  =  -  — ^      (4.28) 
2      *       16/ss„    (/       16  2/ss    ;/      32  fe    * 


From  (2.35)  IM3  =  3HD3  and  we  can  substitute  (4.28)  into  this  to  obtain  IM3. 


HD3    : 

1  k 

~32IssAf 

1    uC    ^ 
"32     fe     A* 

IM3  ■- 

32  ks    'f 

3   /|C««lf/,2 

"32      /,s     ^ 
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In  summary  the  distortion  at  the  output  of  the  mixer  will  be 

(4.29a) 
(4.29b) 


Notice  that  for  IM3,  normally  we  are  interested  in  the  ID3  generated  in  the  desired 
signal  frequency  from  the  two  adjacent  channel  interferences.  These  interferences 
are  denoted  as  vintel.ference  and  have  amplitudes  denoted  as  Ainterference.  To  quantify 
the  IM3  in  this  case,  we  rewrite  (4.29b)  as  follows: 

IM     -  A  A  4  2   _  A  VCoxJ^    .  2  (A  nry. 

11VI3  ^inference  ~-         ,  ^inference  y^r.JyJ) 

i^  iss  j^     iss 

From  now  on,  IM3  will  be  expressed  in  terms  of  Ainterference-  Referring  to  (4.29a) 
and  (4.30),  we  observe  that  as  Iss  goes  up,  distortion  goes  down.  However,  as  the  j- 
ratio  goes  up,  distortion  goes  up.  Therefore  to  design  a  low  distortion  mixer,  one 
needs  to  burn  more  power  (undesirable)  and  keeps  the  j-  ratio  (or  size)  down  (there 
is  a  limit).  Of  course,  the  amplitude  of  the  input  signal,  A,.f,  or  that  of  the  interfer- 
ence, Ainterference>  should  also  be  kept  down,  but  their  values  are  dictated  by  the 
overall  receiver  front  end  design. 

If  A,f  is  small,  then  the  current  flowing  through  M;  can  be  assumed  to  be  equal  to 
that  through  M2,  or  half  of  Iss.  Then  we  can  write  (4.29a)  as 

^-k£*-kti<£^*-*m^v?*    <4'31> 

If  y4interference  is  small,  similar  considerations  apply  to  (4.30). 
Then  (4.29a)  and  (4.30)  can  be  rewritten  as 

K       1 


32  (Vbs,  -  V, 


,2 


3/4 

m     =  interference  (4  32) 

32(VGSl  -  vtf 

We  can  characterize  this  distortion  using  AIP3,  amplitude  of  vRF  or  Vinterference  at  the 
third  order  intercept  point,  as  well.  We  start  from  (2.38),  HP3\dBm  =  P'\dBm 2     • 
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2       4  2 

As  shown  in  Problem  4.4  this  equation  will  lead  to  the  equation  A#>3    =  asaap*  _ 

Substituting  IM3  obtained  in  (4.32)  in  this  equation,  we  have 

/»2  -JOCT/__  l/M2/!2  10CT/__  1/  "l2 


^/F3 

interference  _ 

IM3 

J>^  c 

GS\         v  t)   -^interference 

342 

interference 

or 

A/f3 

=  *\j\{vGSi  - 

-v() 

(4.33) 


Finally,  we  look  at  the  case  when  Vlo  is  switching,  but  with  zero  rise  and  fall  time 
(i.e,  an  ideal  square  wave).  Even  though  we  no  longer  have  a  time-invariant  system 
since  Vlo  is  an  ideal  square  wave,  the  situation  can  be  handled  by  modifying  the 
preceding  results  through  multiplying  the  output  by  the  Fourier  series  representa- 
tion of  the  square  wave.  The  effect  of  switching  is  simply  shifting  the  frequency  of 
the  fundamental  and  the  third-order  products  by  ro/o.  The  amplitudes  are  both 
reduced  by  the  same  amount,  l/n,  which  means  that  HD3,  IM3  remain  unchanged. 

Numerical  example  4.2. 

Assume  that  a  Gilbert  mixer  operates  under  the  following  condition: 

VGSl  -  V,  =  0.387V      Arf  =  interference  -  0.3  16V        Or  0  dBm 

Assume  that  the  LO  is  not  switching.  Find  the  mixer's  distortion  behavior:  HD3, 
IM3,  IIP3. 

We  assume  that  the  Gilbert  mixer's  distortion  is  dominated  by  the  V-I  converter. 
Substituting  the  foregoing  values  in  (4.32)  we  have 

HD3  =  -33.1dB 

IM3  =  -  23.5dB 
From  (2.38) 

UPsUm  =  Pi\dBm  ~  ^  =  0  +  2^f=  12-5  dBm 

Numerical  example  4.3. 

From  Table  2.1,  the  mixer  is  assigned  an  IIP3  of -10  dbm.  In  this  example  we  want 
to  design  this  mixer  with  some  safety  margin.  Let  us  arbitrarily  set  the  specifications 
to  be 

IIP3  =  5  dBm 

Power  =  lOmW 
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Vdd  =  3.3  V 

k'  =  100  hA/V2 

Assume  that  V\0  is  not  switching.  Find  (^)  j .  , 

IIP3     =     5dBm     means     that     5  dbm  =  10  log^jg-     Solving     we     have 


AIP,  =  \l 2  x  50  x  IO^t1-  =  \/l  x  50  x  10t  =  0.56  V.  Substituting  into 
(4.33)  we  have  (VgSj  -  V,)  =  0.17  V.  Now  with  P  =  lOmW,  Vdd  =  3.3V,  we 
have  I  =  3mA.  Substituting  half  of  this  value  of  current  into  the  square  law  current 
equation,  with  a  given  k'  =  100  wA/V2,  we  get  (77) i=  1038. 


4.6     Distortion,  High-Frequency  Case 

In  this  section  we  look  at  the  distortion  behaviour  of  a  mixer  at  high  frequency. 
To  reiterate,  for  the  bulk  of  this  section  we  assume  that  Vi0  is  not  switching  and  so 
the  mixer  is  a  NLTI  system.  Returning  to  the  Gilbert  mixer  in  Figure  4.6,  at  high 
enough  frequency  (e.g,  1.9  GHz),  the  assumption  that  there  is  no  memory  effect 
due  to  capacitors  (such  as  junction  capacitors  Csb,  Cdh)  and  inductors  present  in 
the  mixer  is  no  longer  correct.  These  capacitors  can  be  nonlinear  capacitors, 
which  of  course  introduce  their  own  nonlinear  effect.  But  even  if  they  are  linear 
capacitors  (such  as  Cgci  Cgs),  their  presence  renders  the  calculation  of  distortion 
due  to  nonlinear,  memoryless  components  (such  as  that  due  to  the  square  law  I-V 
characteristics)  unamenable  to  simple  Taylor  series  analysis,  as  described  in 
section  4.5.  Specifically,  because  of  memory  effect  we  will  see  that  (4.20), 
repeated  here, 

/+  =  aiVrf  +  a2Vl  +  a3V^f  +... 

is  no  longer  correct. 

To  see  why  that  is  so,  let  us  change  our  focus  from  a  specific  nonlinear 
circuit  with  memory,  the  Gilbert  mixer,  to  some  general  discussion  of  nonlinear 
system  with  memory.  We  start  off  our  discussion  by  reverting,  for  the  time  being, 
to  a  linear  memoryless  system,  with  input  x  and  output  y.  Using  the  same  symbol  a;, 
which  denotes  linear  gain  in  (4.20),  to  denote  the  gain  of  this  linear  system, 
we  have 

y  =  axx  (4.34) 

From  (4.34)  we  can  see  that  y,  the  output  at  a  particular  instant,  depends  on  x,  the 
input  at  that  particular  instant  only,  and  not  on  inputs  at  any  other  instant.  From  a 
circuit  point  of  view,  this  will  only  be  true  if  the  circuit  contains  no  memory 
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(a  resistor  will  be  an  example).  As  soon  as  the  circuit  has  memory  (e.g,  a  capacitor), 
then  for  this  simple  linear  system  the  output  at  a  particular  instant  is  dependent  on 
all  the  past  input  values.  This  output  is  then  obtained  by  summing  all  the  effects  of 
these  past  inputs.  It  is  easier  to  see  this  in  the  discrete  time  domain,  and  so  y, 
incorporating  memory  effect,  is  first  written  in  the  discrete  time  domain: 

n. 

y(n)=   Y^  h(x)-x(n-x)  (4.35) 

T— —  OC. 

where  n  is  the  time  index,  h(x)  is  the  weight  that  signifies  how  each  past  input 
sample  affects  the  present  output  sample  and  h(x)  is,  of  course,  simply  your  familiar 
impulse  response.  The  sum  in  (4.35)  is  the  familiar  convolution  sum.  Next  we  write 
y,  incorporating  memory  effect,  in  the  continuous  time  domain.  The  sum  in  (4.35) 
becomes  an  integral,  the  familiar  convolution  integral,  and  (4.35)  becomes 


y 


h{x)x{t  ~  x)  d%  (4.36) 


where  h(x)  is  the  impulse  response.  To  summarize,  because  of  memory  effect,  we 
use  (4.36),  rather  than  (4.34),  as  the  input-output  relationship  of  a  linear  system. 

The  incorporation  of  memory  effect  can  be  extended  to  a  nonlinear  system.  As 
an  example,  we  start  off  with  a  system  such  as  the  one  described  in  (4.20),  which 
describes  the  input-output  relationship  of  a  nonlinear  system  that  is  memoryless. 
To  incorporate  memory  in  this  nonlinear  system,  every  term  in  (4.20)  should  be 
replaced  by  an  integral,  in  much  the  same  way  that  (4.34)  becomes  (4.36).  Conse- 
quently (4.20)  is  no  longer  correct. 


4.6.1     Volterra  Series 

We  will  now  develop  a  general  theory  that  can  allow  us  to  calculate  high-frequency 
distortion  for  a  NLTI  system,  including  memory  effect.  This  is  the  theory  of 
Volterra  series.  When  applied  to  circuits  it  shows  that  the  high  frequency  effect 
can  degrade  the  distortion  performance  easily  by  close  to  100%  more  than  predicted 
using  low  frequency  analysis  (see  Numerical  example  4.4).  Also  when  applied  to 
fully  differential  (balanced)  circuits  with  no  mismatch,  it  reveals  the  surprising 
result  that  there  can  be  second  harmonic  distortion  (HD2)  as  high  as  -32db  (see 
Numerical  example  4.4),  when  the  low  frequency  analysis  predicts  the  HD2  should 
be  zero.  This  HD2  would  have  come  from  a  circuit  with  an  equivalent  mismatch  as 
high  as  2.5%  and  can  be  a  major  concern  because  it  means  RF  feedthrough  will  still 
be  present  in  a  balanced  mixer  with  zero  mismatch.  In  addition  to  providing  tools 
necessary  for  high  frequency  low  distortion  analysis/design  the  Volterra  series  also 
helps  in  verifying  this  design.  This  is  because  a  significant  component  of  the 
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Fig.  4.9    Block  diagram  of  a 
discrete  time  LTI  system 


verification  involves  simulations  of  the  circuit.  To  properly  interpret  results  pro- 
duced by  the  simulator  the  analysis  tool  used  in  the  simulator  should  be  well 
understood.  The  use  of  Volterra  series  for  high  frequency  distortion  in  mixers  has 
been  adopted  in  many  popular  simulators  (e.g.  SpectreRF  [6]).  This  should  help 
motivate  the  interest  of  the  readers  in  the  following  sections  which,  on  first  glance, 
may  appear  a  little  mathematical. 

To  further  motivate  the  readers  regarding  the  usefulness  of  Volterra  series  we 
can  take  comfort  in  the  fact  that  the  theory  can  be  applied  to  high  frequency 
distortion  analysis  of  any  NLTI  system.  In  this  book  this  includes  mixers  in  chapter 
4,  5,  LNA  in  chapter  3,  power  amplifier  in  chapter  9  and  BPF  in  chapter  2. 


4.6.1.1     Introduction 

We   introduce   the   concept  of  Volterra   series   by   going   through   systems   of 
increasing  complexity. 


Case  la:  Linear,  discrete 


We  start  off  by  going  back  to  the  simplest  system:  a  linear,  discrete  time  system  as 
shown  in  Figure  4.9. 


1.  First  assume  h(x)=l  for  all  x.  Then  we  have: 

yn        -*fl  "r  Xri—  1  t 


(4.37) 


2.  In  general  h(x)  is  a  function  of  x,  the  time  index  and  the  output  is  a  convolution 
sum: 


yin)  =  Yl  h^  ■xin-t) 


(4.38) 


Case  lb:  Linear,  continuous 

We  now  evolve  to  the  continuous  time  system,  where  the  sum  changes  to  an 
integral. 


y(t) 


h(x)x(t  —  t)  dx 


(4.39) 
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Fig.  4.10    Block  diagram  of  a 
discrete  time,  NLTI  system 


Case  2a:  Nonlinear,  Discrete 

Next  we  introduce  a  mild  nonlinearity,  a  bilinear  nonlinearity.  We  again  start  off  in 
the  discrete  time  case.  Hence  f(xn)  —  x„2.  The  resulting  system  is  shown  in 
Figure  4.10: 

1.  If  we  assume  h(%\,  %<£)  —  1  for  all  (ti,  x2),  then 
yn      xn.xn. 

\Xn.     •  Xn—\  -f"  Xn—\  •  Xn—\  -)-... 
"T  Xn.  ■  Xn—2  T"  Xn—\  •  Xn—2    i    ^n—2  '  Xn—2    \     •  ■  • 


+X„.  ■  X0  +  X„_i  •  x0  +  x„_2  -x0  +  ...+  x0  ■  x0 

n  n  —  1 

—  /     XjiXj  ~r    y     Xn—\Xj  -\-... 


(4.40) 


Notice  that  this  simple  bilinear  nonlinearity  has  changed  the  output  from  being  a 
simple  single  sum  as  described  in  (4.37)  to  that  of  a  double  sum  as  shown  in 
(4.40). 
2.  In  general,  h(xu  x2)  is  not  1  but  is  a  function  of  time  indexes  (t;,  t2),  as  in  Case 
la  (2).  Notice  this  has  made  the  simple  single  sum  shown  in  (4.37)  become  a 
weighted  single  sum  as  shown  in  (4.38).  Likewise,  (4.40)  will  become  a 
weighted  double  sum,  which  is  the  convolution  sum  of  this  bilinear  discrete 
system  and  is  given  as 


y. 


n  =  y^y~]  hfa ,  r2)x(n  -  Ti)x(n  -  t2) 


(4.41) 


Because  of  the  double  summation,  h  is  now  a  function  of  two  time  indexes:  i1;  x2. 
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Case  2b:  Non-linear,  Continuous 

Similar  to  the  evolution  of  Case  la  (2)  to  Case  lb,  Case  2a  (2)  can  also  evolve  to 
the  continuous  time  system  where  the  sum  changes  to  an  integral.  Equation  (4.41) 
becomes 


y(0  = 


h{X\,  %2)x(t  —  X\)x(t  —  X2)d.X\d%2 


(4.42) 


It  just  takes  a  little  bit  more  imagination  to  figure  out  that  in  general  for  an  nth- 
order  nonlinearity  described  by  y  —  x" ,  (notice  this  n  describes  the  order  of  the 
nonlinearity  and  is  not  to  be  confused  with  the  symbol  n  used  to  describe  the  time 
index  in  a  discrete  time  system.  We  choose  to  reuse  this  symbol  so  as  to  save  using 
new  symbol,  and  also  because  from  now  on  we  will  be  dealing  exclusively  in 
continuous  time  systems  and  so  no  confusion  should  arise.)  equation  (4.42) 
generalizes  to 


y(t) 


h\(%\)x(t  —  Ti)  dx\  + 


hn{x\,  X2  ■  ■  ■  xn)x(t  —  X\)x(t  —  %2)  ■  ■  .x(t  —  xn)dxi  . . .  dxn 

(4.43) 


in  which  for  n  =  1,2... 

hn(x\ , . . .  t„)  =  0  for  any  Xj<0J  —  1 , . . .  n 

Notice  that  the  first  term  is  similar  to  the  convolution  integral  for  a  linear  system. 
Equation  (4.43)  is  the  Volterra  series  expansion  of  an  «th  order  nonlinear  system. 


4.6.1.2     Comparisons  with  Taylor  Series 

At  this  point  it  may  be  illustrative  to  compare  the  expansion  form  in  Taylor  series 
and  in  Volterra  series.  Let  us  just  repeat  the  Taylor  series  expansion  from  (4.20),  but 
using  y  and  x  as  the  output  and  input  variables  instead. 

y  =  a\x  +  a2X  +  . . . 

We  can  interpret  this  form  of  Taylor  series  expansion  by  considering  x,  x2. . .  as 
the  basis  function.  Similarly  we  can  repeat  the  Volterra  series  expansion  from 
(4.43)  above 
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y(t) 


h\{t\)x(t— t\)dx\+... 


hn(x\...xn)x(t—X\)...x(t—X\)...x(t—xn)dx\...dxn 


=H1[x(t)]+H2[x(t)}+...H„[x(t)] 


(4.44) 


Drawing  the  analogy  with  the  interpretation  of  Taylor  series  expansion,  we  can 
interpret  the  different  order  of  integrals  J ,  J  J ,  J  J  J  . . .  (single  integrals,  double 
integrals,  etc.)  as  the  basis  functions  again.  They  can  be  viewed  as  operators  so  that 
I '  // '  ///  •  •  •  are  rePiaced  by  operators  Hu  H2,  H3  . . .  and  so  on. 


4.6.1.3     Properties  of  a  Bilinear  System 

We  continue  our  investigation  of  the  Volterra  series  by  again  considering  the 
simplest  NLTI  system,  the  bilinear  system.  One  property  of  the  bilinear  system 
that  is  different  from  the  conventional  LTI  system  is  that,  whereas  for  an  LTI 
system  for  an  input  sinusoid  consisting  of  one  frequency  the  output  sinusoid  will 
consist  of  only  one  frequency,  for  the  bilinear  system,  a  single  input  frequency  will 
generate  two  output  frequencies. 

To  see  this,  first  let  us  reiterate,  for  a  bilinear  system, 


y(t,  h) 


h2{t\  —  x\,t2  —  x2)x(x i)x(x3)  dx\  dx 


Taking  the  Fourier  transform, 

Y{j(ol,jco2)  =  H2(j(oujm2)x(ja>i)x(ja>2) 


(4.45) 


where  H2  =  2-dim  Fourier  transform  of  the  impulse  response  h2(xt,  x2).  The  1-dim 
transform  then  is 


Y(j<») 


271 


F(M,M>)  da>i 


271 


Yijcoijco-jcox)  dan 


Next  we  go  through  different  cases  corresponding  to  different  inputs. 


Case  1 :  One  Input  Exponent 
For  an  input  of  e7™1',  the  output  is 


YU<*>) 


1 

271 


H2(co\i co  —  oj1)(5((X»1  —  a>o)5(co  —  co\  —  coq)  da>i         (4.46) 
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Then,  upon  integration, 

Y{j(o)  =   *-H2{co  -  oo0,  «o)<5(ffl  -  2oo0)  -  e^'  (4.47) 

In 

The  exponential  term  simply  means  that  in  the  time  domain  the  output  will  be  a 
sinusoid  with  frequency  at  2co0.  Hence,  as  stated  previously,  a  single  input  fre- 
quency generates  two  output  frequencies,  with  one  being  two  times  the  other. 
The  concept  of  transfer  function  is  still  valid  and  is  now  H2(ooq,  ooq).  Furthermore, 
this  transfer  function  is  unique  and  does  not  depend  on  initial  condition  or  signal. 


Case  2:  Two  Input  Exponents 

Next  we  apply  two  exponents.  Their  Fourier  transform  becomes 

X(joo)  =  d((o-co1)  +  S(oo -oj2)  (4.48) 

Then 

YU<°)  =^r  [H2(co  -  (oucoi)8(co  -  2coi)] 

2% 

+  —H2(ao  -  co2,  oo2)d(oo  -  2oo2) 
2n 

H H2(oo  —  oo2,oo2)S(oo  —  oo\  —  m2)  (4.49) 

2n 

Notice  that  as  opposed  to  the  one-exponent  case,  a  cross  term  appears.  Therefore, 
the  principle  of  superposition,  which  is  fundamental  to  an  LTI  system,  does  not 
apply.  Specifically,  in  an  LTI  system  if  the  input  consists  of  two  frequency 
components,  ooj,  oo2,  the  output  consists  of  only  the  same  two  frequency  components 
oo j,  oo2.  In  the  case  of  a  bilinear  system,  the  output  consists  not  only  of  these  two 
frequencies  (each  doubles  up;  i.e,  2ooj,  2oo2),  but  also  the  crossterm  ooj  +  oo2. 


4.6.1.4     Circuit  Representation  1  of  an  NLTI  System 

In  order  to  work  with  circuits  rather  than  mathematical  models,  we  want  to  develop 
circuit  representation  for  an  NLTI  system.  First  it  can  be  seen  that  the  preceding 
general  description,  where  the  nonlinearity  and  memory  part  of  the  circuit  are  not 
distinctly  separated,  makes  the  circuit  representation  complicated,  even  for  a  simple 
bilinear  system.  We  can  simplify  this  situation  if  the  NLTI  system  allows  us  to  lump 
parts  that  are  nonlinear  (but  memoryless)  together  and  separate  them  from  the  parts 
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Fig.  4.11    Circuit 
representation  1 


IM3=3HD3 


resistive  nonlinearity 

Non-linear 

But  memoryless 


Memory 
But  linear 


ox 


Fig.  4.12  A  circuit  that 
cannot  be  represented  by 
circuit  representation  1 


Si^A^- 


t 


C=r= 


Non-linear 
resistor 


that  have  memory  (but  are  linear).  Such  a  representation  [1]  is  shown  in 
Figure  4.11. 

The  circuit  shown  in  Figure  4.1 1,  which  is  the  circuit  representation  for  one  such 
class  of  NLTI  system,  is  driven  by  an  input  signal  5,.  This  is  applied  to  a  nonlinear- 
ity and  develops  an  intermediate  signal  S„.  This  intermediate  signal  is  then  applied 
to  the  part  of  the  circuit  with  memory  but  is  linear  and  develops  a  final  output 
signal  Sox.  This  second  operation  that  involves  a  linear  circuit  with  memory  is 
equivalent  to  a  filtering  operation.  Simple  and  innocent  as  this  circuit  representation 
may  be,  it  is  rather  easy  to  take  an  existing  circuit  and  wrongly  assume  that  the 
circuit  can  be  represented  as  in  Figure  4. 1 1 .  The  circuit  shown  in  Figure  4. 12  is  one 
such  wrong  example. 

On  first  encounter,  we  may  think  that  Figure  4.12,  which  consists  of  a  nonlinear 
resistor  (nonlinear  but  memoryless)  driving  a  linear  capacitor  (linear  but  with  mem- 
ory), can  be  described  by  circuit  representation  1 .  We  would  probably  further  make 
the  naive  assumption  that  we  then  can  describe  this  circuit  by  writing  the  following 
convolution  integral  that  relates  output  to  input:  S0  —  J h(t  —  x)S,(x)  dx  with 
h{t  —  x)  =  e«?,  where  R  is  replaced  by  a  nonlinear  R  and  we  would  then  have 
completely  solved  the  problem.  This  is  WRONG.  It  turns  out  that  in  the  preceding 
case  the  nonlinear  resistor  R  in  Figure  4.12  is  part  of  the  X(jai)  block  in  Figure  4.1 1 
and  so  this  simple-looking  circuit  cannot  be  described  by  circuit  representation  1. 
Instead  we  have  to  invoke  the  full  Volterra  series  expression  from  (4.43).  This  also 
highlights  the  numerous  pitfalls  that  we  can  easily  fall  prey  to  when  drawing 
conclusions  about  anything  that  involves  nonlinearity,  even  though  the  system  may 
look  rather  simple.  For  the  sake  of  completeness,  if  we  have  a  circuit  as  described  in 
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Fig.  4.13    A  circuit  that  can 
be  represented  by  circuit 
representation  1 


Buffer 


1y$/- 


Figure  4.13  (where  R  is  nonlinear),  provided  that  the  buffer  is  linear  and  memoryless, 
then  it  would  can  be  described  by  circuit  representation  1 . 

Having  highlighted  the  potential  problem  of  using  circuit  representation  1  to 
model  a  NLTI  system,  let  us  point  out  its  main  advantage,  namely,  the  simplicity  it 
offers  in  representing  the  NLTI  system.  Here  we  can  simply  apply  Taylor  series 
representation  to  the  first  block  to  generate  the  intermediate  variable.  Then  we  filter 
the  intermediate  variable  to  produce  the  final  output.  To  recognize  the  power  of  this 
two-step  approach,  let  us  use  it  to  help  illustrate  one  important  distinction  between 
nonlinear  systems  with  and  without  memory. 

Let  us  assume  the  resistive  non-linearity  is  represented  by  Sc,  =  a\Sj  +  aiSj2  + 
. . .  and  that  the  memory  element  X(j(o)  has  the  magnitude  and  phase  frequency 
response  as  shown  below  in  Figure  4.14,  [1],  Let  us  assume  that  S,  consists  of  two 
input  sinusoids  at  a>i,C02  as  shown  in  Figure  4.14.  At  the  output  S„  has  an  HD3  and 
an  IM3  component  as  shown  in  Figure  4.14.  So  far  this  is  a  memoryless  system,  and 
from  (2.35)  IM3  =  3HD3.  Hence  IM3  is  3  times  HD3  as  shown.  However,  upon 
filtering,  since  X(J(o)  has  a  nonconstant  frequency  response,  the  HD3  component  is 
substantially  attenuated  (filtered  out),  so  that  at  the  output  Sox,  this  3  times  ratio  is 
no  longer  true.  If  we  look  at  the  whole  block  in  Figure  4.11  as  a  nonlinear  block 
with  memory,  it  is  obvious  that  IM3  is  not  3  times  HD3  On  the  other  hand,  if  we 
have  analyzed  the  whole  block  by  ignoring  the  memory,  then  the  IM3  =  3HD3 
conclusion  does  appear.  In  general,  circuit  representation  1  helps  us  gain  insight 
into  other  aspects  of  the  NLTI  system  as  well. 


4.6.1.5     Representation  of  a  Bilinear  System  Using  Circuit  Representation  1 

Case  1 :  Two  Input  Sinusoids 

Now  let  us  use  circuit  representation  1  to  represent  a  bilinear  system  and  derive 
output  for  the  case  with  two  inputs  sinusoids. 
Referring  to  Figure  4.11  let  us  apply 


Sj  =  Si  cos  ojit  +  S2  cos  (ajt 
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Fig.  4.14    Magnitude  and  phase  response  of  X(ja) 


Because  it  is  a  bilinear  system  we  have 


S0  =  a\Si  cosct>i?+  a\S2  cosa>2t 


«2r„2 


[Sj  cos((coi  ±col)t)  +  2S1S2  cos((c«i  ±  (02)^2)  +  S\  C0S(((02  ±  tt>2)0 


«3 


5;  cos((cui  +  mi  +  ffli)r)  + 


(4.50) 


Due  to  filtering: 


Sox  =  a\S\\X(ja>\)\  cos(co1t+  Icoi).  +  aiS2|X(./'ci>2)|cos(<x>2f  +  L(o-i) 

+  -j  [S2i\X(Ja>i  +M)|cos((c«i  +coi)t+lX(j2co1))+Sl\X(jco1  -jcox) 
+  —  [2SiS2\X(ja>i  ±;'a;2)|  cos((ffli  ±m2)t+  LX(j((D\  ±co2)))] 


a3 


-S]\X(ja>i  +jmi  +yffli)|cos((o)i  +  a>\  +  0Ji)t  +  lX(j3mi))  + 


(4.51) 


Notice  that  this  derivation  generates  seven  frequency  components  that  can  be 
summarized  by  ±  coa  ±  mi,,  where  a,  b  can  be  equal  or  different  and  we  have  both 
the  +  and  -  combination. 
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Case  2:  n  Input  Sinusoids 

Next  we  discuss  the  general  case  where  we  have  n  input  sinusoids.  Here  the  output 
Sox  will  be 

Sox  =  aiSi\X(jcoi)\  cos  +  . .  .a2S\\X{j(0\  ±j'coi)|cos+  . . .  (4.52) 

To  simplify  writing  out  all  the  long  expressions,  let  us  use  the  short-hand 
notation,  [1] 

Sox  =  aiX(joja)  o  Si  +  a2X(jmaJojh)  o  sf  +  . . .  (4.53) 

Here  S,  =  voltage  or  current  in  time  domain.  Operator  o  means  once  you  select 
the  proper  a,  b,  c. . .  multiply  each  frequency  component  in  S"  by  \X(jcoa, . .  .)|  and 
shift  phase  by  LX(jwa,j(QB  . . .),  and  do  this  for  all  n  components:  a,  b,c  . . ..  Notice 
that  both  +a,—a,  +b,—b,  +c,—c  . . .  appear.  Also  notice  in  doing  the  permutations, 
you  should  include  the  case  when  a  —  b,  b  =  c,  a  =  b  =  c,  and  so  on.  You  can,  of 
course,  combine  the  overlapping  terms. 

We  now  go  through  a  few  examples  to  illustrate  this  short-hand  notation.  In  the 
first  example,  we  pick  X(  /coa,  jmh)oS,2: 

X{jma,  ja>b)  o  Sj  =  X(j(oa,ja>h)  o  SaSb  represents  the  following  terms: 

\X(-jmu-ja>i)\S^X(-jcou-j(Oi) 
\X{j(0] ,  -ja>2)\Sf^X(jmi ,  -jm2) 

\X(ja>2,jO}2)\SflX(jC02,jC02) 
\X(-jC02,  -jC02)\SflX(-JQ)2,  -jco2) 

\X(-jmuJQ}2)\S2lX(-jcouj(o2) 

|X(M,  -jcoiMlXiJcou-jan)  (4.54) 

First  let  us  pick  a  =  1,  b  =  1,  or  the  first  term  in  (4.54).  From 
S,  =  Si  cos  co 1 1  +  S2  cos  oi2 1,  we  are  only  concerned  with  S \  cos  co  1 1.  But  since  it  is 
multiplied  by  Sj  (that  is  the  square  term),  we  have  to  modify  Sicosojit  to 
5;  cos(a>i  +  coi)rand  so  \X(joJi,jcoi)\S^X(jcoi,jcoi)  becomes  \X(jco\  +  jm\)\S\  x 
cos(coi  +  a>i)tlX(jcoi  +jcoi)  which  equals  \X(jcoi  +ja>i)\S]  cos((oji  +  a>\)t+ 
lX(j2o3\)).  This  agrees  with  the  first  term  under  the  first  ^  expression  of  (4.51). 

Next  let  us  pick  a  =  2,  b  =  2,  then  we  have  \X(jw2,jco2)\SJ lX(joj2j<x>2), 
which  is  the  fourth  term  in  (4.53).  Repeating  the  foregoing,  this  time  from  5,  —  S\  x 
cos  c«i f  +  S2Cosa)2?  we  are  only  concerned  with  S2  cos  C02t.  But  since  it  is 
multiplied  by  Sf  (that  is,  the  square  term),  that  means  we  have  to  modify  S2  X 
cosct>2f  to  52Cos(ct)2  +  di)t  and  so  finally  \X(jco2,jco2)\S} £X(jcQ2,jco2),  becomes 
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\X(ja>2  +j<x>2)\&2 cos(o)2  +  (02)tlX(ja>2  +j<x>2)  which  equals  \X(ja>2  +j<x>2)\S\ 
C0s((a>2  +  (o2)t  +  lX(j2co2))- 

Let  us  go  to  the  third  term:  \X(  jw\ ,  — ja>2)  \Sj lX(jm\ ,  — JCO2).  To  get  this  term  we 
would  have  picked  a  =  1,  b  =  -1,  and  so  we  stick  to  this  choice.  According  to  the 
explanation  of  the  short-hand  notation,  this  means  that  we  multiply  each  frequency 
component  in  Sj  by  \X{jco\  —  i<x>2 ) |  and  shift  phase  by  LX(j(Oi  —  j'tt>2)(notice  that 
the  comma  becomes  a  minus  sign,  because  we  have  picked  (+1,  —1). 
Since  S,  =  S\  cos  to \t  +  S2  cos  <x>2t,  there  are  two  frequency  components,  and  we 
should  multiply  each  frequency  component  in  S,  by  \X(ja>i  —ja>2)\  and  shift  by 
lX(ja>i  —j<x>2).  Let  us  start  from  the  first  component,  Sicoscoif.  Since  it  is 
multiplied  by  Sj  (that  is  the  square  term),  we  have  to  modify  S\  cos  (0\t  to 
SiS2Cos(<x»i  —  <x>2)t.  Notice  that  we  have  S1S2  as  opposed  to  S\S{  =  Sj  because  a 
=  1  and  b  =  —2  this  time.  Hence  the  first  term  becomes  \X(j(0\  —  7CO2)  | 
5'iS2Cos((cui  —  a>2)t  +  lX(j(Di  —ja>2))-  This  agrees  with  the  first  term  under  the 
second  y  expression  of  (4.51). 

If  we  continue  with  this  practice  on  all  the  seven  terms  in  (4.54),  we  will 
generate  all  the  terms  represented  by  the  short-hand  notation.  Of  course,  we  have 
to  eliminate  any  overlapping  terms.  The  final  expression  will  be  identical  to  the 
combination  of  all  the  second-order  terms  in  (4.51). 

We  next  go  through  another  example  illustrating  the  short-hand  notation.  In  this 
example,  suppose  we  are  interested  in  the  third  order  term,  but  we  still  only  have 
two  frequency  components  as  inputs  (i.e,  S, ■  —  Si  cos<x>i?  +  S2  cos  <x>2f)-  Then  a 
possible  expanded  term  from  aT,X(j(oaJWj,,j(oc)  o  Sj  will  be 

\X(jmi  -jco2  +jco2)\SiS2S2 

x  cos((<x»i  —  a>2  +  C02)t+  lX(ja>\  —J002  +J0J2))  (4.55) 

For  yet  another  example  illustrating  the  short  hand  notation,  suppose  we 
have  three  frequency  components  as  inputs  (i.e,  S, ■  —  Si  cos a>it  +  S2coso)2t+ 
S3  cos  mj,t).  Yet  we  are  interested  in  the  second-order  term,  X(jcoa,j<x>t,)  o  Sf  = 
X(jcoa,ja>h)  °  SaSb-  Then  because  there  are  three  possibilities,  a  —  1,  b  =  2  is  not 
the  only  choice.  We  can  have  a  =  l,b  =  2,  a  =  2,  b=3,  a  =  3,  b=  1.  Starting  again 
with  a  =  \,b  =  2  we  have: 

\X(ja>i  +j'ffl2)|5i52cos((ffli  +  co2)t+  LX(ja)\  +JCU2))  (4.56) 

This  agrees  with  the  first  term  under  the  second  y  expression  of  (4.5 1).  Next  we 
look  at  the  case  when  a  =  2,  b  =  3  and  we  have: 

\X(J0J3  +J(02)\S3S2C0S(((03  +  m2)t+  lX(jm3  +JCO2)) 

Finally  we  look  at  the  case  a  =  3,  b  =  1  and  we  have 

l^(iffl3  +jffli)|535'i  cos((ffl3  +  coi)t+  ^X(jm3  +jo)i)) 
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Fig.  4.15    Circuit  representation  2 

4.6.1.6     Circuit  Representation  2  of  an  NLTI  System  [1] 

To  make  the  preceding  representation  more  general,  we  can  assume  that  the  input 
goes  into  a  linear  element  with  memory  first  (represented  as  a  linear  filter  in  front). 
Figure  4.11  is  redrawn  in  Figure  4.15  with  this  modification. 

As  an  example  of  how  this  system  works,  let  us  start  with  two  input  sinuosids: 

S,'V  =  Si  cos aj\t  +  S2 cos a>2t 

First  since  Y(jco)  is  linear  then  superposition  applies  and 

St=  |y(;'ffl1)|SiCos(tt»i?+ZoJi)  +  \Y(ja>2)\S2Cos(co2t+  Ig>2)  (4.57) 

To  be  consistent  we  adopt  the  short-hand  notation  that  we  developed  in  (4.53) 
and  we  have 


St  =  Y(jm)  o  5„ 


(4.58) 


Then  assuming  that  the  nonlinearity  has  coefficients  ax  a2  and  so  on,  we  apply  5, 
to  this  nonlinearity  and  we  have 


S0  =  aiSi  +  a2S]  +  a3S^  +  ...  (4.59) 

Substituting  (4.55)  into  (4.58)  we  have 

So  =  aiY(jco)  o  Siy  +  a2 [Y(joj)  o  Siy]  +a3 [Y(jco)  o  Sty]  +... 
If  we  expand  this  we  will  find  that  the  second-order  term  consists  of 

^[|r(M)l2Si{cos[((»i +(»!){  + /r(M)  +  /F(;a)i)]  +  cos[(fflI-fflI)t+/r(M)-/y(M)]}] 

+  y[2S1S2|yO'»1)||r(joj2)|{cos[K  +»2)f+  lY(joh)  +  lY(joi2)\  +cos[(o>i  -oj2)t+  lYUco,)  -  zr(>2)]}] 
+  y  [|F(y<02)|2^{oos[(<D2  +  W2)t+LY(jco2)  +  LY{jcoi)\  +  cos[(ffl2  -  w2)t  +  LY(jo)i)  -  ^(jo)2)]}] 

(4.60) 
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This  is  a  complicated  expression,  but  fortunately  if  we  use  the  short-hand 
notation  it  can  in  turn  be  written  as 

a2Y(jcoa)  ■  Y(j(ob)  o  S? 

with  ma  ±  a>h  a  —  1,2  b  =  1,2 

Let  us  do  a  quick  check  here:  For  the  first  term  a  —  1,  b  =  1,  we  have 
a2\Y(jwi)\\Y(j(Di)\S\(LY(jwi)  +  lY(jco\)),  We  rewrite  this  term  in  its  complete 
form  and  we  have  a2|J'(iffli)||i'(7ffli)|5'i  cos((<x>i  +  oj[)t+  2lY(jco1)) 

Looking  at  (4.60)  one  can  see  that  under  the  first  ^  expression,  the  first  term 
agrees  with  this  term.  Hence  the  short-hand  notation  is  consistent. 


Interesting  Observation  1 

Note  that  the  output  from  circuit  representation  2  has  an  important  but  subtle 
difference  from  the  output  from  circuit  representation  1 .  The  phase  shift  contribu- 
tion in  the  circuit  representation  2  case  is  2LY(ja>\).  On  the  other  hand,  in  the 
circuit  representation  1  case,  the  phase  shift  contribution  is  lX(j2o3\).  Physically 
this  difference  can  be  explained  as  follows:  In  circuit  representation  1  we  have 
applied  the  signal  to  the  nonlinearity  first,  then  a  component  at  2a>i  is  generated. 
This  component  then  got  filtered  by  X(ja)  and  therefore  contributes  a  phase  shift  to 
be  calculated  at  2oi\,  giving  a  phase  shift  contribution  of  lX(j2ojl).  On  the  other 
hand,  in  circuit  representation  2,  we  do  the  filtering  first.  Therefore,  the  component 
remains  at  ©i.  This  filtering  operation  contributes  phase  shift,  which  should  be 
calculated  at  co\,  giving  a  phase  contribution  of  lY(ja>\).  Upon  going  through  the 
nonlinearity,  two  of  these  signals,  each  at  frequency  coi,  and  each  contributing  a 
phase  shift  of  lY(jm{),  combine  together  to  generate  a  component  at  2<x>\. 
Therefore,  the  total  phase  shift  is  the  sum  of  these  two,  or  2lY(jojl). 


Interesting  Observation  2 

The  linear  filter  and  the  nonlinear  resistive  components  are  not  commutative. 
Referring  to  Figure  4.11  this  means  that  Sox  will  be  different  if  the  two  blocks  are 
swapped. 


4.6.2    Analysis  of  Gilbert  Mixer 

Let  us  return  to  our  main  goal  of  Section  4.6,  which  is  the  derivation  of  IIP3  of  the 
Gilbert  mixer  at  high  frequency.  Again  we  assume  that  at  high  frequency  the 
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Fig.  4.16    SCP  as  V-I  converter  with  parasitic  capacitance 


distortion  is  still  dominated  by  the  V-I  converter.  Hence  let  us  redraw  SCP,  this  time 
including  parasitic  capacitance. 

First  we  want  to  highlight  the  dominant  parasitics  capacitors.  Typically,  the 
MOS  transistor  that  implements  the  current  Iss  has  the  largest  size  (larger  than  the 
input  transistor  M i_2  and  the  switching  transistors  Mj,_6)  and  hence  we  assume  that 
its  drain  to  bulk  capacitance  C^  is  dominating.  In  parallel  to  this  C^  are  the  source 
to  bulk  capacitance  of  Mx  and  M2,  and  hence  we  lump  all  three  of  them  together  and 
denote  it  Cj.  Again  to  simplify  analysis  we  further  assume  that  these  capacitors  are 
linear,  even  though  in  real  life  Cj  is  nonlinear.  Hence  the  only  non-linearity  comes 
from  the  square  law  characteristics  of  the  device.  It  should  be  noted  that,  similar  to 
Figure  4.12,  Figure  4.16  cannot  be  represented  by  either  circuit  representation  1 
(Figure  4.11)  or  circuit  representation  2  (Figure  4.15),  and  hence  (4.53)  or  (4.59) 
cannot  be  used  to  obtain  id.  Accordingly,  a  complete  Volterra  series  analysis  must 
be  applied  to  Figure  4.16. 


4.6.2.1     Summary  of  Steps 

First  we  present  a  summary  of  the  steps. 

All  small-signals  terms  are  in  lowercase.  We  assume  that  a  small-signal  differ- 
ential input  voltage  vrf  is  applied  and  a  small  signal  output  current  id  is  developed. 

In  order  to  determine  distortion  in  id  with  respect  to  vrt  we  do  the  following: 

Step  1:  Write  KCL  at  the  tail  node  (source  node)  with  node  voltage  vs  and 
determine  the  Volterra  series  expansion  of  the  intermediate  variable  vs  in  terms 
of  input  Vrf,  using  the  short-hand  notation. 


=  Hi  o  v^  +H2o  vff  +  H3o  vy  + 


(4.61) 
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Step  2:  Use  MOS  device  equation  in  the  small  signal  form: 

k  (  ,2 

id  —  2  V'f  ~  V*J 
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(4.62) 


to  express  output  current  id  in  terms  of  input  voltage  vrf  and  the  intermediate 

variable  vt. 


Step  3:  Substitute  vs  determined  in  (4.61)  (from  step  1)  in  (4.62)  and  derive 


id  =  G\o  vlf  +  G2  o  vjL  +  G3  o  y" 


rf 


(4.63) 


Gn  now  becomes  a  function  of//,,.  Since  //„  has  been  determined  in  step  1,  G„ 
can  now  be  solved.  This  calculation  will  be  carried  out  on  each  G„  sequentially. 

Step  4:  Derive  IM3,  HD3  in  terms  of  G„.  Since  G„  has  been  determined  in  step  3, 
IM3,  HD3  can  now  be  calculated. 

The  step-by-step  Volterra  series  derivation  of  the  distortion  for  the  SCP  in  Figure 
4.16  is  now  presented. 


4.6.2.2     Determine  Volterra  Kernel  Hn 

We  begin  step  1  by  applying  KCL  at  the  tail  node: 


dVs  k 

Cd  —7-  +  Iss  -  ~ 

at  2 


(vgsl-vt)2+(vgS2-vty 


(4.64) 


Let  us  define  Vgsi  —  Vcs  +  Vgs^  Vgs^  =  Vgs  +  vgJ2and  substitute  into  (4.64). 
Then  we  have 


„  dVs  k 

Cd^i  +  I$S~2 


((vGS-v,.sl)-v,)2+((i/GS-vgS2)-y,): 


0        (4.65) 


Expanding  (4.65),  we  have 


fifvs  k 

Cd^t+hs~2 


VU  +  2vOT  (VGS  -  V,)  +  (VGS  -  V,Y+vzgS2  +  2vgS2(VGS  -  V,)  +  (VGS  ~  V,)*]  =  0 

(4.66) 

Here  -^is  replaced  by  -jf-,  because  the  dc  term  in  V„  when  differentiated  with 
respect  to  time,  becomes  0.  Expressing  the  input  voltage  in  terms  of  differential  and 
common  mode  voltage, 
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y_rf_  _  -Vrf    _ 

vg"i   ~    2         V-s  VSS2  ~     2  V-s 


(4.67) 


We  can  then  substitute  this  into  (4.66)  and  get: 


„  dvs  k 

Cd  —j-  +  Iss  -  ~ 

at  2 


VJ  -  v,)  +2(^  -  v,)  •  (VGS      V.)  +  (Vcs      V,)2 


v'i      v^  +2  (-M.  _  v\  (yGS  _  Vt)  +  (VGS~Vl)2 


2         V        V   2 
Expanding  (4.68)  we  have 


=  0 


(4.68) 


dvs  k 

Cd^  +  hs~2 


<f    ,    „2 


+  V2S  -  Vrf  ■  VS  +  Vrf{VGS  -  V,)  -  2vs{VGS  -  V,)  +  {VGS  -  Vt) 


+  -f  +  Vl  +  V'S  ■  V*  -  V'^VGS  -  V')  -  2V°(VgS  -  V>)  +  ^  "  V'f 


Simplifying  (4.69)  we  have 


=  0 


„  dvs      ,        k 


Vrf 


+  2v]  -  Avs{VGS  -  V,)  +  2(VGS  -  V,f 


(4.69) 


=  0  (4.70) 


Expand  vs  as  a  sum  of  its  linear  term,  square  term,  and  so  on: 

v.,  =  vSl  +  vS2  +  vs,  +  . . .  (4.71) 

Substituting  (4.71)  into  (4.70)  and  taking  the  phasor  representation,  we  have 


Cdj(o{vSl  +vS2  +  ...)  +ISS 


-^  +  2(v,,+v,2  +  ...)2 


-  4(v„  +  v,2  +  . .  .)(VGS  -  Vt)  +  2(VGS  -  V,Y 


(4.72) 


Next  we  would  want  to  find  the  linear  term,  square  term,  and  so  on  of  vs 


Linear  Terms 


Let  us  start  from  (4.72),  keeping  only  the  linear  terms  in  vv  (no  dc  terms  such  as  Iss, 
VGs  -  Vt,  ignore  higher-order  terms  like  vrf )  and  we  have 
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jCdcovSl  +  2kvSl  (VGs  -  V,)  =  0 


•  v„  =0 


However  from  (4.61)  we  have 


Hence  this  means  that 


VSl   =  H1  O  Vrf 


//,  =  0 


(4.73) 


(4.74) 


Second-Order  Terms 

Keeping  only  the  second-order  (or  square)  terms  in  (4.72),  and  we  have 

k 


Cdj{(Oi  +  m2)vS2 


W 


+  2v£  -  4v,2  •  (VGS  -  V,) 


=  0  (4.75) 


Notice  that  co  becomes  cd1+co2  because  we  are  interested  in  the  second  order 
terms. 

Substituting  (4.73)  in  (4.75),  we  have 


j(a>\  +  co2)Cdvs. 


4vS2(VGS-V,) 


=  0 


(4.76) 


Let  us  repeat  (4.61)  here: 

vs(t)  =  Hi(co)  o  Vrf  +  H2{co\,  co2)  o  vff.  +  ..H3(oju  co2,  oj3)  01^  +  ...     (4.77) 
Since  we  are  interested  only  in  the  second  order  terms,  we  have 

vS2  =  H2  o  vl  (4.78) 

Substituting  (4.78)  into  (4.76),  we  have 

k 


j(a>i  +  CQ2)CdH2(co\,032)  o  vrf  -  - 


AH 


2((Oi,co2)  o  Vtf(VGS  -  Vt) 


=  0 
(4.79) 
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Factoring  out  H2  and  vrf  in  (4.79),  we  have 

v2f 

2    _  ,,  V 


[./Oi  +  co2)Cd  +  2k(VGS  -  v,)]H2(mum2)  o  vlf  =  k 


4 


H2(coum2) 


k 
4 


(4.80) 


j(coi  +  co2)Cd  +  2k{VGS  -  V,) 


Because  of  the  operator  o  ,  (4.80)  actually  consists  of  four  equations  for  four 
different  cases: 

We  can  do  some  more  simplification  by  noting  that  at  intermediate  frequencies 
when 

j(coi  +  m2)Cd  «  2k{VGS  -  Vt)  (4.81) 

we  can  take  the  Taylor  series  expansion  of  (4.80),  retain  the  first  two  terms  only, 
and  get 

H2((oum2)=— —  M-— _ _ -  (4.82) 

8(yGS  -  Vt)  V        2^(^0.5  -  Vt) ) 

For  the  low-frequency  case  we  can  practically  set  a  =  0,  and  (4.82)  becomes 

H2(ojuco2)=  1  (4.83) 

°{Vgs  -  Vt) 


Third-Order  Terms 

Let  us  start  from  (4.72)  and  do  the  following:  replace  jco  by  j(m1  +joj2  +j<x>j), 
since  we  are  interested  in  the  third-order  terms.  Equation  (4.72)  becomes 


Cdjipi  +  C02  +  CD3)(vSl  +  V,2  +  V,3  +  ...)+  7.5.5 

k 

~2 


-j-  +  2(vSl+vS2+vs,  +  ...f. 


-4(v„  +  v.S2  +  v,3  +  . . .)  (VGS  -  Vt)  +  2(VGS  -  yt)2]  =  0  (4.84) 
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First  we  want  to  expand  the  2(vSl  +  v,2  +  vS3  +  . . .)  factor  and  look  for  third- 
order  terms  in  the  expansion: 

2(v„  +  vS2  +  vS3  +  . .  .f=  l(fSi  +  v\  +  v2S3  +  v]t  +  . . . 

+2vSlvS2  +  2vSlvS3  +  2vSlvS4  +  . . .  2vS2vS3  +  2vS2vS4  +  . . .)  (4.85) 

In  (4.85),  third-order  terms  come  from  a  single  vS3  term  or  a  product  of  a  vSl  term 
and  a  v,2term.  Since  the  vS3  term  is  inside  the  bracket,  which  goes  through  a  square 
operation,  then  any  terms  it  generates  will  be  higher  than  third  order.  Hence  the 
only  third  order  term  in  this  factor  is 

4v.SlvS2  (4.86) 

Turning  to  the  rest  of  factors  in  (4.84),  it  is  obvious  that  there  is  no  third-order  term 

i 

2 


in  the  factor  I$s,  -f,  2(Vcs  —  Vt)  .  Hence  if  we  keep  only  the  third-order  terms, 


(4.84)  becomes 


Q/(©i  +co2  +  co3)VS3  -  -  [4vSlvS2  -  4vS3(Vcs  -  V,)}  =  0  (4.87) 

Next  we  take  (4.61),  retain  the  first  three  terms  and  write  them  down  as 


(4.88) 


Vs,   =HlO  Vrf 

vS2  =H2o  vjf 

vS3  =H3o  v3rf 

Substituting  (4.88)  into  (4.87),  we  have 

k 
Cdj(a>i  +  <d2  +  a>3)#3  o  vrf  -  - 

X 

4(//1ov,/)(/y2ov^)  -4/ 

=  0  (4.89) 

Factoring  out  vrf  and  dropping  it,  we  have 

Q/(coi  +co2  +  oj3)H3  -  -  [4HJh  -  4//3(VGs  -  V,)}  =  0  (4.90) 


Here  H1H2  is  defined  as 

H1{(Oi)H2{co2  +  (o3)  +H1(a>2)H2(a)3  +  oj31)  +  H1(co3)H2((oi  +  co2) 


HlH2  ^ 

(4.91) 
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Notice  the  argument  of  F^  always  consists  of  one  frequency  component  and  that 
of  H2  always  consists  of  two  frequency  components.  The  bar  is  there  to  ensure  that 
all  the  possible  permutations  are  exercised. 

Rearranging  (4.90)  we  have 

Hi [j{aH  +oj2  +  co3)Cd  +  2k(VGS  -  Vt)]  =  2£/7^  (4.92) 

Finally,  we  solve  for  H3: 


2kHxH2 

H3((Oi,  0)2,0)3) 


;(©i  +  ffl2  +  o)3)Cd  +  2k(VGS  -  Vt)  (4.93) 


This  complicated  looking  formula,  however,  has  a  simple  answer.  From  (4.74), 
Hy  =  0,  and  hence  substituting  (4.74)  into  (4.91),  we  have 


HXH2  =  0  (4.94) 

Substituting  (4.94)  in  (4.93),  we  have 

H3  =  0.  (4.95) 

This  brings  us  to  the  end  of  step  1 . 

Summary  and  Interpretation  of  Results  from  Step  1 
From  step  1  we  have  determined  that 

v.5  =  v.,i  +  vS2  +  v.s3  +  ...=//[  o  vrf  +  H2  o  \?f  +  H3  o  vjf  +  . . . 
and  that 

k 

H1=0,H2(<o1,<o2)=  .,      ~      ,r   *.,,., ^T^3  =  0  (4.96) 

j{(Oi  +  o)2)Cd  +  2k(VGS  -  V,) 

Note  H;  =  0  is  simply  restating  the  familiar  result  that  if  we  concentrate  on  the 
linear  response  of  the  circuit  (or,  equivalently,  that  the  circuit  is  represented  as  a 
linear  circuit),  then  the  source  of  Mx  and  M2  is  an  ac  ground,  which  means  vs  =  0. 
This  result,  of  course,  is  presented  in  any  text  that  analyzes  the  SCP. 

Another  interesting  fact  is  that  H2  ^  0.  How  do  we  explain  this?  Let  us  neglect 
memory  effect  for  the  time  being.  Then  any  text  on  SCP  will  show  that  the  output 
current  (on  either  output  branch)  is  an  odd  function  of  the  input  voltage,  due  to 
symmetry.  It  just  follows  from  the  mathematical  property  of  an  odd  function  that 
the  output  current  cannot  have  even  harmonics,  including  the  second  harmonics. 
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We  may  postulate  that  since  the  device  characteristics  have  second-order  nonline- 
arity  (due  to  the  square  law),  for  the  output  current  to  possess  no  second  harmonics, 
the  gate  to  source  voltage  v„  must  possess  second  harmonics.  Since  the  gate  voltage 
v„  =  v,.f  has  no  second  harmonics,  then  the  source  voltage  vs  must  have  second 
harmonics  to  "cancel"  the  device  non-linearity.  This  implies  that  vs2  7^  0  and 
therefore  H2  is  nonzero. 


4.6.2.3     Relating  Volterra  Kernel  G„  to  Hn 

Now  we  turn  to  step  2.  Let  us  refer  to  Figure  4.16  again  and  concentrate  on 
transistor  M;. 

To  reiterate,  our  goal  is  to  write  id  in  the  following  form: 


id  =  G\o  Vrf  +  G2  o  vfy  +  G3  o  v^  - 
or,  alternately  as 

id  =  idl  +  >d2  +  'V/3  +  •  ■  • 

From  the  device  equation,  we  have 


l,i 


k  r 


k  r 


(vgSl  +  (VGSl  -  V,))1    =-   (vm  +  (VGS  -  V,)) 


(Note:  Because  Vq^^  =   Vq<§2>  we  set  them  equal  to  Yes-) 


^+^gSl(VGs-Vr)  +  (Ves-VrY 


(^  -  v.s)2+2(^  -  vs)  (VGS  -  V,)  +  (VGS  -  V,f 


The  small  signal  output  current  can  now  be  written  as 


id  -  idM,  -  Id, 
k 


lDM, 


(¥ _  "■)  +2(? _  Vi)(Vos  - v,) + (Vos  - v,)1 

(!-,)2+2(^-,>os-K,,l 


(4.97) 


(4.98) 


(4.99) 


(VGS  -  v,y 


v,/(v.Sl  +  vS2  +  vS3  +  ...)  +  (vS]  +  vS2  +  v.S3  +  . . .) 


,v'/, 


+2f(VGS  -  V,)  -  2(v,,  +  vS2  +  v.Sl  +  . .  .)(VGS  -  V,) 


(4.100) 
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4.6.2.4     Solving  G„ 

In  step  3  we  want  to  express  (4.100)  in  the  form  of  (4.98),  where  the  first-, 
second-and  third-order  terms  are  separated.  Then  we  can  calculate  G„. 


Determine  G\  from  first  order  term  idl 

In  this  sub-section  we  first  want  to  isolate  the  first-order  terms  in  (4.100).  The  first 
term  in  (4. 100)  is  a  square  term  in  vrf  and  would  not  have  contributed  to  a  first-order 
term.  The  second  term  is  a  cross  product  of  vrt  and  a  first-  or  higher-order  term  of 
vs  (vsi>  vs2>  ■  ■  •)  and  so  does  not  contribute  either.  Likewise  the  third  term  does  not 
contribute.  The  fourth  term,  2^(Vgs  —  Vr),  does  contribute.  Finally,  the  term 
2vSl(Vcs  —  Vt),  in  the  fifth  term,  also  contributes.  Collecting  these  contributions 
and  equating  them  to  idx  we  have 

fe.  =  \  [vrf(VGS  -  V,)  -  2vSl  (VGS  -  Vr)]  (4.101) 

Next  let  us  apply  the  results  from  step  1,  sub-section  4.6.2.2,  to  (4.101)  and 
simplify.  Specifically,  substituting  (4.73)  in  (4.101),  we  have 

hh=~[^(yas-V,)]  (4.102) 

Meanwhile,  equating  (4.97),  (4.98)  and  we  find  idx  is  also  expressed  as 

idl=Gxovlf  (4.103) 

Finally,  equating  (4.102)  and  (4.103),  we  find  that  Gx  as 

Gi  =~(VGS  -  Vt)  =  -ky'-^  =  ;  ul,  (4.104) 


Determine  G2  from  Second-Order  Term  id2 

As  in  sub-section  determining  G1;  we  first  isolate  the  second-order  terms  in  (4.100). 

From  (4. 100),  the  first  term  is  — .  This  contributes  a  second-order  term.  For  the  second 

term  the  only  second-order  term  in  it  is  —  v,fVSl .  For  the  third  term,  the  second-order 
term  in  it  is  vSl2.  For  the  fourth  term,  there  is  no  second-order  term.  For  the  fifth  term, 
the  only  second-order  term  in  it  is  —  2vS2(Vgs  —  Vr).  Collecting  these  contributions 
and  equating  them  to  id2  we  have 
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id2((o1,co2) 


4 


VrfVsl 


2vS2(VGS-Vt) 


(4.105) 


Next  we  can  simplify  (4.105)  based  on  the  results  from  step  1  in  sub-section 
4.6.2.2.  First,  referring  to  (4.73):  vsl  =  0.  Secondly  applying  (4.80)  to  (4.78),  we 
obtain  vs2. 

Substituting  these  values  of  vsl,  vs2  in  (4.105),  we  have 


idl[pim1) 


i 

4 

1 


2H2(cou  oj2)(VGS-V,)ovl. 
%{Vos-Vt) 


V 


(4.106) 


2  [4     y(a>i  +  w2)Cd  +  2k{VGS  -  V,)\ 
Meanwhile  equating  (4.97)  and  (4.98)  we  find  that  id2  is  also  expressed  as 

id2  =  G2ov)f  (4.107) 

Finally  equating  (4.106)  and  (4.107)  and  simplifying,  we  find  G2  as 


<h  = 


1 


1 

1     I    j(coi+CQ2)Cd 

"T   2k(VGS-v,) 


(4.108) 


As  in  step  1,  sub-section  4.6.2.2,  we  can  apply  the  intermediate  frequency 
simplification.  Hence  we  apply  (4.81)  to  (4.108),  take  Taylor  series  expansion, 
retain  the  first  two  terms  only,  and  we  have 


G2((O\032)  =  - 


'j{co\  +  m2)Cd 


2k(VGS  ~  V,) 
1_  j(coi  +  m2)Cd 
16'    (Vgs-V,) 


(4.109) 


Determine  G3  from  Third  Order  Term  id3 

As  in  sub-sections  determining  Gi  and  G2,  we  again  first  isolate  the  third-order 
terms  in  (4.100).  From  (4.100)  the  first  term  is  -4-,  This  does  not  contribute  to  any 
third-order  term.  For  the  second  term  the  only  third  order  term  in  it  is  —  VrfVS2. 
For  the  third  term,  the  only  third-order  term  is  2vs,  vSz.  For  the  fourth  term,  there  is 
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no   third-order  term.   For  the   fifth   term,   the   only   third-order  term   in   it   is 
—  2v3  (Vgs  —  Vt).  Collecting  these  contributions  and  equating  them  to  i^j,  we  have 

k 
ich  =  j  [-v,/vS2  +  2vSl vS2  -  2vS3(VGS  -  V,)}  (4. 1 10) 

Next  we  simplify  based  on  the  results  from  step  1,  sub-section  4.6.2.2.  Let  us 
express  all  the  vs  in  (4.110)  according  to  (4.88).  We  have 


'*  =  2 


Vrf  o  H2((OiC02)  o  v^  +  Hi(mi)H2(miOj2)  °  v^  -  2{VGS  -  V,)H3  o  v]f 


(4.111) 


One  thing  that  is  worth  noting  is  that  H2(co\co2)  in  (4.88)  becomes  H2{(D\(D2)  in 
(4.111).  This  is  because  Ihe  H2(<x>\W2)  term  assumes  different  value,  depending  on 
the  combination  of  the  pair  of  frequencies  (co\Co2)  that  it  is  being  evaluated  at. 

We  can  now  refer  to  (4.96)  and  note  that H1  =  H3  =  0.  Therefore  in  (4. Ill),  the 
H\H2  term  in  the  second  term  is  zero,  which  means  that  the  second  term  is  zero. 
Similarly,  the  H3  term  in  the  third  term  is  zero  and  so  the  third  term  is  zero  as  well. 
Hence  (4.111)  simplifies  to 

id,  =^-H2(mloj2)o  v^ 


-k 
T 


H2(a>i(o2)  +H2(w\(03)  +H2{m2m-}) 


o<  (4.112) 


3 
Again  equating  (4.97)  and  (4.98),  we  find  that  id3  is  also  expressed  as 

ij3=G3°v*f  (4.113) 

Therefore,  equating  (4.112),  (4.113),  we  have 


Gi(aiiOi2<D3)  =  ~-H2(miOj2)  (4.114) 


Substituting  H2  from  (4.96)  in  (4.114),  we  have 


G3((Uiffl2ro3)  ee ■ —    4  — •  (4.115) 

2j{coi  +  (o2)Cd  +  2k(VGS  -  Vt) 


This  brings  us  to  the  end  of  step  3. 
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Summary  and  interpretations  of  results  from  step  3 

We  have  determined  G\,  G2,  G3: 


Gi=\{Vcs-Vt)=\kJ^  =  \jM;s 
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Go  = 


1 

j(cOl+G>z)Cd 


2k(Vcs-V,) 


(4.116) 


(4.117) 


Gt,((D\(02(Oi) 


2j(coi  +  m2)Cd  +  2k(VGS  -  V,) 


(4.118) 


Notice  Gi  is  simply  the  transconductance  (output  current/input  voltage)  and  its 
value  agrees  with  the  transconductance  obtained  using  simple  linear  analysis.  If  G\ 
is  normalized,  it  also  agrees  with  aY  derived  using  Taylor  series  in  (4.23).  This 
shows  that  the  memory  effect  due  to  the  capacitor  C&  does  not  have  any  impact  here. 
G2,  which  corresponds  to  a2  in  (4.24),  highlights  the  impact  of  the  capacitor.  Notice 
that  a2  is  zero  (due  to  symmetry  of  the  SCP)  but  G2  is  nonzero. 

Finally  let  us  examine  G3  at  low  frequency:  practically  <D\  =  0J2  —  0J3  =  0. 

Since  0)1=012  =  CO3  =  0  we  can  see  that  ^((Di, (02) , #2(02, 0)3) , 7/3(0)3, coi) 
are  all  equal.  From  step  1,  sub-section  4.6.2.2,  we  have  calculated  7/2(0)1,  asunder 
the  low  frequency  condition  in  (4.83).  Using  that  value,  we  have 


H2(coi,m2)  =//2(ffl2,c»3)  ^  H3(m3,coi) 


1 


Was  ~  V,) 


Substituting  this  in  (4.114),  we  have 


Gi 


-k 


16(VGs  -  V,) 


(4.119) 


4.6.2.5     Find  IM3,  HD3 

In  step  4  we  generalize  the  definition  of  HD3  from  (2.29)  (derived  under  no  memory 
effect)  to  the  following  definition  (which  is  valid  for  cases  with/without  memory 
effect): 


HD-i  =  l±  = 


G3  o  v|  _  \Gi  °  {Arf  cos cmf)3|  ^  Giiwuwua^  A} 
G\ov,f         \G\  o  A^coscojfl  G[  4 


(4.120) 
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Next  let  us  generalize  the  definition  of  IM3  from  (2.34)  (derived  under  no 
memory  effect)  to  the  following  definition  (which  is  valid  for  cases  with/without 
memory  effect): 


IM3 


G3{(0\,(0\,  a>2)  o  Arf(coscoit  +  COSCO2O 


\G\  o  Ar/cosct)if| 


(4.121) 


Notice  that  on  the  surface  this  expression  is  quite  similar  to  the  definition  of  HD3 
as  given  in  (4.120).  The  difference  stems  from  the  fact  that  G3  is  now  calculated  at 
(oj  1,  (01,(02)  and  because  of  the  frequency-dependency  effect,  its  value  bears  no 
simple  relationship  to  its  value  calculated  at  (coi,coi,coi),  which  is  what  (4.120) 
uses  to  calculate  G3  and  hence  HD3.  This  is  the  fundamental  reason  why  with 
frequency-dependent  effect  IM3  is  no  longer  simply  3HD3,  as  stated  in  (2.35). 

Next  let  us  specialize  the  IM3  definition  to  the  case  when  we  are  interested  in 
IM3  caused  by  adjacent  channel  interference.  It  should  be  noted  that  for  IM3  we 
have  three  frequencies  (co\,(0\,a>2  =  ~(0\  —  Am).  For  IM3  caused  by  adjacent 
channel  interference,  a>2  is  so  defined  (to  be  at  —  (Oi  —  Am)  such  that  2a>i  +  a>2 
lies  on  the  same  frequency  as  a>\  —  Aco.  This  guarantees  that  the  third-order 
nonlinearity  that  takes  m\  and  (02  as  the  inputs  and  generates  the  2(0\  +  (02  term 
will  end  up  generating  a  term  at  <x>\  —  Act),  which  is  the  undesired  frequency. 
Meanwhile  Arf  would  be  replaced  by  Ainterference.  Applying  these  observations  to 
(4.121),  we  get 


IM3 


G3(mu cui , (o2)  o A^nterference(cos 0Jit  +  cos (o2t): 


\G\  °  ^interference  COS  ffljf  | 

=  |G3(ffll,ffll,ffl2)|      2  3 

\/^   I  interference  a  {'t.LZ.z.) 

We  would  now  calculate  HD3  and  IM3  For  low  frequency  (no  memory  effect) 
we  can  find  HD3  by  substituting  (4.116)  and  (4.119)  into  (4.120).  We  have 

Al            k                  1 
HD3=-f—— -—■-.    -. (4.123) 


Now 


4    16(VGS-Vt)    IVfcfe 


{Vgs-V,?^  (4.124) 


k 
Substituting  this  into  (4.123)  and  simplifying,  we  have 

1      4  k 


HD3= — ^=-^=  =  ^—  (4.125) 

/kiss      32  Iss 
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Equation  (4.125)  agrees  with  the  result  obtained  using  Taylor  series  expansion 
(4.28). 

Likewise,  we  can  do  the  same  for  IM3  and  it  would  agree  with  the  Taylor  series 
expansion  case.  For  intermediate  frequency  where  j(col  +  oj2)Cd  "C  2k(Vcs  —  Vt), 
we  could  have  started  with  the  expression  for  G3  in  (4.118)  and  found  an  interme- 
diate frequency  approximation.  Alternatively,  we  know  that  in  (4.114),  G3  is 
expressed  in  terms  of  H2,  for  which,  incidentally,  we  have  found  an  intermediate 
frequency  approximation  [in  (4.82)].  Hence  we  start  with  (4.120),  substituting  in 
(4.114)  and  (4.116),  we  have 


//£K 


G3  A/ 
G,    4 


IH2      Arf 

VWs   4 


(4.126) 


Setting  <x>3  =  (o2  =  co\  and  substituting  the  intermediate  frequency  approxima- 
tion of  7/2  from  (4.82),  we  have 


HD-i  = 


1 


1 


\^dis    2  8(VGS-V,) 


A2 

Arf 

Iss32-{VGS-Vt) 


j((£>l  +OJ2)Cd 

2k(Vcs  -  V 
j(2ooi)Cd 

t)\ 

2k(VGS  -  Vr)\ 

A2 
Arf 


(4.127) 


Next  we  can  find  an  expression  for  IM3.  We  start  from  (4.122).  To  obtain 
G3  in  (4.122)  we  go  to  (4.1 14)  and  we  have 


G3(a>iCL>2ffl3) 


■-H1{(olm2) 


To  avoid  confusion  between  the  subscript  1,  2,  3,  we  rewrite  the  expressions  as 


G^{ma(ob(oc) 


-H2{cDa<Ob) 


H2{wa(Qh)  +H2{cohmc)  +H2(ojcoja) 


(4.128) 


From  (4.122)  we  know  we  are  interested  in  Gi,(<x)i,co\,co2),  so  essentially  we 
should  assign  a=  \,b=  1,  c  =  2.  Following  the  convention  as  described  in  (4.128), 
we  assign  the  a,  b  associated  with  H2's  argument  and  we  have 


G3(ffl1,ffl1,ffl2) 


H2(a>icoi)  +H2(cou  -C02)  +H2{-a>2,(Oi) 


(4.129) 
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At  this  point  remember  that  we  are  dealing  with  the  intermediate  frequency 
case  and  we  would  use  this  fact  for  simplification.  Therefore,  we  substitute  the 
intermediate  frequency  approximation  to  H2  as  calculated  in  (4.82)  into  (4.129): 


Gi(w1,a)i,a2) 


Hi((0\G>i)  +H2(0)l,0)2)  +H2(o>2,0)l) 


k           1 

2  8(VGS  - 

v«) 

k           1 

2  8(VGS  - 

v.) 

k           1 

2  8(VGS  - 

v.) 

k           1 

jCd 


2k(VGS  -  Vt) 
JCd 


(coj  +  CO])  +  (aii  +  a>2)  +  (co2  +  coi) 


(<±>i  +  (0\)  +  (coi  -i — tt>i  —  Aco)  +  (— <±>i  —  Aco  +  <±>i) 


2k(VQS  -  Vt) 


2  8(VGS  -  Vt) 


2*(VGS  -  Vt) 
JCd 


2k(VGS  -  V,) 


2(co 

-Aco)' 

2coi 

3 

) 

Substituting  Gx  from  (4.116)  and  G3  from  (4.130)  into  (4.122),  we  get 
T~  3/12 


/M. 


/.ss32(yGS-v,) 


1 


2*(V, 


;Cd  2tt^ 

v,)x  3 


G.V 


(4.130) 


(4.131) 


Furthermore,  substituting  (4.124)  into  (4.131),  we  finally  get 


IM3  = 


3A 


interference 


32(VGS  -  Vtf 


2  j{(0\)Cd 

3  2k(VGS  -  V,) 


(4.132) 


It  is  instructive  to  compare  (4.131)  to  (4.127)  and  note  that  even  when  we  set 

^rf  =  ^interference*  IM3  ^  3HD3. 

Finally  we  look  at  the  case  when  Vlo  is  switching,  but  with  zero  rise  and  fall  time. 
Even  though  we  no  longer  have  a  time-invariant  system  since  Vlo  is  an  ideal  square 
wave,  the  results  can  still  be  easily  obtained.  In  the  high-frequency  case  the 
distortion  still  comes  primarily  from  the  bottom  SCP  when  doing  V-I  conversion, 
which  does  not  switch.  We  can  again  incorporate  the  switching  effect,  for  a  Vlo  that 
is  an  ideal  square  wave,  by  modifying  the  aforementioned  results  through 
multiplying  the  output  by  the  Fourier  series  representation  of  a  square  wave. 
As  in  the  low-frequency  case,  the  effect  of  switching  is  simply  shifting  the  fre- 
quency of  the  fundamental  and  the  third  order  products  by  /lo.  The  amplitudes  are 
both  reduced  by  the  same  amount,  1/tt,  and  HD3,  IM3  remain  unchanged. 
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Numerical  example  4.4. 

Assume  that  a  Gilbert  mixer  has  a  V-I  converter  as  shown  in  Figure  4.16.  Referring 
to  Figure  4.16  let  Mx  and  M2  both  have  a  ^  of  50um/0.6um. 

Let  us  apply  a  Vrf  at  1.9  GHz  and  two  interference  signals  Vinterference  at 
1.9017  GHz  and  1.9034  GHz.  Their  power  levels  are  set  at  0  dBm.  Again  we 
make  VGSl  -Vt  =  0.387V.  Now  assume  k!  =  100mA/V2.  This  means 
k  =  6250uA/V2.  Cd  is  given  as  Cd  =  100/F. 

Finally  let  us  assume  that  Vi0  is  not  switching  and  that  the  Gilbert  mixer's 
distortion  is  dominated  by  the  V-I  converter. 

(a)  Assume  intermediate  frequency  approximation  holds,  find  IM3  of  this  Gilbert 
mixer. 

(b)  Assume  intermediate  frequency  approximation  does  not  hold,  find  HD2  of  this 
Gilbert  mixer. 

(a)  Since  interference  signal  has  a  power  level  of  0  dbm,  this  means  Ajnterference  = 
0.316  V.  Substituting  this  and  other  relevant  values  into  (4.132),  the  expression 
for  IM3  using  intermediate  frequency  approximation,  we  have 


IM3  = 


3(0.316) 
32(0.387)2 
0.06329  = 


2    r 


2j(2nx  1 . 90 17G)  100/F 


3  2  x  6250mA/V2 (0.387) 
23.97  db 


=  0.0625|1  —  0.16/1 


(b)  First  we  extend  the  definition  of  HD3  given  in  (4.120)  to  HD2.  After  some 
simplification  we  get 


HD2  = 


Arf\G2\ 

2IG1I 


Next  we  calculate  G2.  Since  the  intermediate  frequency  approximation  does  not 
hold,  we  use  the  full  expression  for  G2  as  given  in  (4.108).  Substituting  the  proper 
values,  we  have 


G2  = 


1 

j{c0\+(O2)Cd 


2k{VGS-V,) 


6250uA/V2 


1 

,     .     /(2x2;rxl.9G)  100/F 
1    '    2x6250iM/V2(0.387) 


|G,|  = 


6250mA/V2 


781  uA/V2 


1 


j(2x2;rxl.9G)10q/F 


"  2x6250  (M/V2  (0.387) 


1 

[1+0.493/1 


=  781  uA/V2  x  0.442 


|G2| 


4,1  /T/2 


(0\  =  0)2  =  1.9  GHz 


3.45  x  lO^A/V- 
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Next  we  calculate  Gi  by  substituting  the  proper  values  into  (4.116).  We  have 
Gi  =|(VGs  -  Vt)  =  3125zM/y2(0.387V)  =  1209wA/V 

Finally  substituting  Gi,  G2  into  the  definition  of  HD2  we  have 


0.316V     3.45  x  10~4A/V2 
HD2  =  — - —  x 


2  1209kA/V 

=  0.045 
=  -26.9  db 

As  a  comparison,  let  us  repeat  the  calculation  for  this  HD2,  except  this  time 
we  ignore  the  memory  effect.  First  let  us  extend  the  definition  of  HD3  for  a 
general  memoryless  NLTI  system  [given  in  (4.26)]  to  the  defintion  HD2,  again 
for  a  general  memoryless  NLTI  system.  We  have  HDj  —  \  j-A,f.  For  a  Gilbert 
mixer,  from  (4.24)  a2  =  0.  Hence  the  HD2  of  a  Gilbert  mixer,  ignoring  memory 
effect,  is  0.  This  is  different  from  the  HD2  calculated  incorporating  memory  effect. 
This  difference  may  not  be  surprising  because  when  we  include  memory  effect  the 
Gilbert  mixer  no  longer  has  odd  symmetry  in  the  ac  sense  and  therefore  HD2  7^  0. 


4.7     Noise 

In  this  section  we  are  interested  in  taking  the  NF  specified  for  a  mixer  in  chapter  2  and 
finding  circuit  parameters  that  will  satisfy  this  specification.  As  opposed  to  section 
4.5  and  4.6,  in  the  present  section  the  case  when  Vlo  is  switching  is  very  important 
and  we  will  devote  considerable  attention  to  it.  For  simplicity  the  example  we 
choose  is  the  unbalanced  mixer.  The  methodology  and  result  can  be  readily 
extended  to  the  Gilbert  mixer.  Referring  to  the  unbalanced  mixer  in  Figure  4.3, 
let  us  assume  that  most  of  the  noise  comes  from  the  input  voltage  to  current 
converter  M3.  In  essence  we  have  assumed  the  switching  transistors  Mx_2  do  not 
contribute  any  noise  and  that  the  load's  noise  contribution  (not  shown),  when  input 
referred,  is  negligible. 


4.7.1     Via  Not  Switching  Case 

We  assume,  as  a  start,  and  to  ease  mathematical  complication,  that  V\0  is  not 
switching  and  therefore  a  mixer  is  a  LTI  system.  Turning  to  the  unbalanced 
mixer  in  Figure  4.3  all  noise  contribution  from  M3  is  propagated  to  the  IF  output 
with  switching  transistors  M;_2  assumed  to  be  on  all  the  time.  Under  this  assumption 
and  with  Mx_2  assumed  noiseless,  the  noise  present  at  the  drain  of  Mx_2  is  the  same 
as  the  noise  present  at  the  source.  Now  the  power  spectral  density  (PSD)  of  the 
current  noise  present  at  the  source  of  M;_2  is  simply  given  by  the  PSD  of  M3's  input 
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E.g.   V-I  converter 

M3 


E.g.    Switching  transistor 

MrM-, 

lni2     Sn3 


IF  filter 


Node  3 


IF    RF 
Fig.  4.17    Block  diagram  representation  of  an  unbalanced  mixer  for  noise  calculation 

referred  voltage  noise  multiplied  by  gm32.  Assuming  that  there  is  only  thermal 
noise  present  in  M3  the  PSD  of  M3's  input  referred  voltage  noise,  denoted  as  S,a  is 
given  by 


S„i  =  AkT  x 


2  1 

3  8m3 


(4.133) 


4.7.2     Vto  Switching  Case 


In  this  section  we  want  to  investigate  the  noise  behaviour  of  a  mixer  when  Vlo  is 
switching  periodically.  For  the  unbalanced  mixer  in  Figure  4.3  this  means  we  have 
to  include  the  effects  due  to  switching  of  Mi_2.. 

To  highlight  the  periodic  time-varying  nature  of  this  mixer,  let  us  represent  the 
unbalanced  mixer  of  Figure  4.3  in  a  block  diagram  form,  as  shown  in  Figure  4.17. 
Note  that  Figure  4. 17  breaks  the  mixer  into  two  stages  a  linear  time  invariant  (LTI) 
first  stage  (corresponding  to  M3  of  Figure  4.3),  followed  by  a  linear  periodic  time 
varying  (LPTV)  second  stage  (the  switching  part,  corresponding  to  Mx-M2  of 
Figure  4.3).  At  the  input  to  the  first  stage,  all  the  noise  sources  from  that  particular 
stage  are  lumped  together  and  input  referred.  This  is  denoted  as  V2j ,  the  equivalent 
input  voltage  noise  source  to  stage  1.  This  noise  gets  amplified  by  stage  1  and 

generates  equivalent  input  current  noise  sources  722  and  /^  at  nodes  2  and  3, 
respectively.  The  PSD  of  these  noise  sources  are  labeled  as  Sn\,  S„2,  S„3.  For  S„i, 
5„2  they  are  assumed  to  have  a  flat  frequency  response  up  to  the  RF  frequencies  and 
beyond  (i.e,  we  have  included  only  the  thermal  noise  from  transistor  M3  and  have 
neglected  the  1/f  noise).  Our  goal  is  to  find  723  and  5„3.  If  the  second  stage  is 
assumed  to  be  a  LTI  system  as  in  sub-section  4.7.1,  5„3's  frequency  response  is  flat. 
However,  if  we  include  the  periodic  time-varying  effect,  this  can  be  more  compli- 
cated. Finally,  the  mixer  output  passes  through  an  IF  filter  (not  shown  in  Figure  4.3) 
to  generate  the  IF  signal.  If  we  assume  that  this  mixer  is  used  in  the  architecture 
described  in  Figure  2.2  then  the  IF  filter  is  the  same  as  BPF3  described  in  that  figure. 
The  bandpass  filter  is  centered  at  coif,  with  bandwidth  BW.  Here  we  assume  its 
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frequency  response  to  be  constant  with  a  value  of  one  from  coif  -  BW/2  to  a>if  + 
BW/2,  and  zero  otherwise.  The  mixer  stages  1  and  2  are  assumed  to  have  no 
frequency  dependent  effect  (in  other  words,  they  have  infinite  bandwidth). 

At  this  point  let  us  highlight  one  important  feature  that  arises  as  a  result  of 
treating  the  switching  stage  as  a  LPTV  system.  We  highlight  this  feature  by  first 
asking  what  would  have  happened  if  we  had  done_the  opposite:  treating  the 
switching  part  as  a  LTI  system.  Under  that  scenario,  /^2,  the  equivalent  noise  at 
node  2  will  have  some  of  its  frequency  component  filtered  by  the  IF  filter.  Specifi- 
cally, those  frequency  components  of  I^2  that  lie  outside  the  bandwidth  of  the 

IF  filter  (i.e,  frequency  components  of  I%2  whose  co  is  smaller  than  coif  -BW/2  or 
whose  co  is  larger  than  coif  +  BW/2)  will  be  filtered.  Next  let  us  return  to  the  scenario 
when  we  do  treat  stage  2  as  an  LPTV  system.  We  assume  that  stage  2  is  varying 
periodically  at  the  LO  frequency,  C0io.  In  this  case,  let  us  again  assume  the  PSD  of 

7^2  also  has  frequency  components  outside  the  bandwidth  of  the  IF  filter.  It  turns  out 
that  not  all  of  these  frequency  components  will  be  filtered  in  the  present  case.  Some 
of  these  frequency  components  will  appear  at  the  IF  output.  For  example,  frequency 

components  of  I%2  whose  co  is  centered  around  corf  (such  that  corf  =  colo  +  coif)  will 
appear  at  the  IF  output,  even  though  corf  itself  lies  outside  of  the  IF  filter  bandwidth. 
This  is  because  upon  switching  this  frequency  component  is  translated  (or  aliased, 
if  we  treat  switching,  or  mixing,  as  a  special  case  of  sampling)  down  to  coif  and  thus 
lies  inside  the  IF  filter  passband. 

In  general,  noise  components  centered  around  multiples  of  corf,  which  are 
normally  outside  the  IF  filter  passband,  can  be  frequency  translated  into  the  passband 
due  to  this  periodic  time  varying  property  and  can  then  pass  through  the  IF  filter. 
This  will  result  in  the  final  output  noise  being  greater  (potentially  a  lot  greater)  than 
that  predicted  by  simply  assuming  the  mixer  is  a  LTI  circuit.  As  a  counter  argument, 
one  may  argue  that  is  not  something  to  be  worried  about,  as  we  have  so  far  neglected 
the  frequency  response  of  the  mixer  stage  2.  We  may  choose  to  argue  that  this  stage 
has  a  frequency  response  with  a  finite  bandwidth  (up  to  now  we  assume  that  it  has  an 
infinite  bandwidth)  and  hence  would  have  filtered  out  the  frequency  components  of 
these  noise  sources  at  multiples  of  corf  at  any  rate,  before  these  frequency  components 
even  get  a  chance  to  get  to  the  IF  filter.  This  turns  out  to  be  overly  optimistic  because 
of  the  fact  that  by  design,  this  mixer  should  be  a  wideband  circuit  so  that  its  bandwidth 
is  usually  rather  wide  and  such  frequency  filtering  does  not  usually  occur.  We  may 
further  appreciate  why  this  mixer  should  be  a  wideband  circuit  by  recognizing  the  fact 
that  the  primary  function  of  a  mixer  (at  least  for  mixers  used  for  down  conversion)  is  to 
frequency  translate  a  high-frequency  RF  signal  to  a  low-frequency  IF  signal. 
This  function  necessitates  the  mixer's  bandwidth  to  be  wide  enough  to  admit  the 
high-frequency  incoming  RF  signal  in  the  first  place. 


4.7.2.1     Theory  of  Linear  Periodic  LPTV  System 

We  will  now  develop  a  general  theory  that  allows  us  to  calculate  noise  for  a  LPTV 
system.  Similar  to  Volterra  series,  which  can  be  used  to  analyze  high  frequency 
distortion  effects  for  any  NLTI  system  with  memory,  this  theory  can  be  used  to 


4.7     Noise 


211 


Fig.  4.18    Block  diagram  of 
the  switching  part  of  an 
unbalanced  mixer 


Node  3 


analyze  noise  in  any  LPTV  system.  The  study  of  this  theory  also  enjoys  those  same 
benefits  as  the  study  of  Volterra  series,  which  were  highlighted  at  the  beginning  of 
section  4.6.1.  For  example,  when  applied  to  circuits  it  shows  that  the  periodic  time 
varying  effect  can  degrade  the  noise  performance  easily  by  close  to  100%  more 
than  predicted  using  the  time  invariant  analysis  (see  section  4.7.3.3).  In  addition  the 
use  of  LPTV  system  noise  analysis  in  mixer  has  been  adopted  in  many  simulators 
(e.g  spectreRF  [7]).  Meanwhile  other  circuits  in  this  book  where  this  theory  can  be 
applied  include  the  LC  oscillator  covered  in  chapter  7  (e.g.  question  7,  chapter  7 
covers  the  basic  phase  noise  analysis  of  such  an  LC  oscillator,  assuming  the 
oscillator  is  a  LTI  system.  The  theory  can  be  used  to  extend  the  results  in  this 
question  to  the  case  when  the  oscillator  is  treated  as  a  LPTV  system.  As  a  matter  of 
fact  this  more  sophisticated  phase  noise  analysis  has  been  adopted  in  the  simulator 
spectreRF  [7]).  The  above  discussions  should  have  given  the  readers  the  proper 
motivations  to  go  through  this  theory,  which  may  involve  a  bit  of  mathematics. 

We  introduce  the  concept  of  LPTV  system  by  considering  again  the  mixer  stage 
2  part  of  Figure  4.17  in  Figure  4.18.  In  Figure  4.18,  S„2  will  be  the  power  spectral 
density  function  of  the  noise  lfa  at  node  2,  and  5„3  will  be  the  power  spectral  density 
of  noise  at  node  3.  As  discussed  before  due  to  the  periodic  time  varying  nature  of 
stage  2,  5„3  will  contain  aliased  noise.  H„  is  denoted  to  be  the  transfer  function  of 
the  noise  from  node  2  to  node  3,  except  that  since  this  is  an  LPTV  system  the 
concept  of  the  transfer  function  (as  well  as  the  input/output  relationship)  has  to  be 
redefined.  (We  choose  to  use  H„  to  denote  this  transfer  function  because  H  has 
customarily  been  used  to  denote  the  transfer  function  of  an  LTI  circuit.  Since  this 
notation  is  restricted  to  the  discussion  on  noise,  there  should  not  be  any  confusion 
with  the  use  of  H„  as  Volterra  kernel  in  section  4.6,  which  focuses  exclusively  on 
the  discussion  on  distortion.)  In  general,  for  such  a  LPTV  system  we  have  a 
modified  definition  of  input-output  relationship,  given  as  [8] 


S„3((oif)  =   ^2   \Hn(cOif)\  Sn2((Oif  +  n&io) 


(4.134) 


H„  has  a  modified  definition  from  the  usual  definition  associated  with  the  system 
transfer  function  H  of  a  conventional  LTI  system  and  is  defined  as  [8] 


Hn(coif)  = 


oo         T 
1 

T  _ 

-oo     b 

I  tT 
T 


h(v  +  u,  u)ejm0hUdu  ■  e-jco?vdv 


h(v  +  u,u)e-joJifVdv 


eJm»°udu 


(4.135) 
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Fig.  4.19    Behavior  of  impulse  responses  of  a  LTI  system 


where  Jx  h(v  +  u,  u)e~',m,fVdv  can  be  interpreted  as  the  time  varying  (periodic) 
transfer  function  from  node  2  to  node  3.  Here  h(v  +  u,  u)  is  the  impulse  response  of 
the  LPTV  system,  and  T  is  the  LO  period.  Now  from  the  definition  of  Fourier  series, 
it  is  self-evident  that  \  fQr  [f™x  h(v  +  u,  u)e'jco'fvdv]ejlmLO"du  is  the  nth-order 
coefficient  of  the  Fourier  series  expansion  of  this  impulse  response.  Since  from 
(4.135)  Hn  is  already  given  as  Hn  =  ±  /Qr  [/^  h(v  +  u,  u)e-JC0-frdv]^mJL0"du, 
then  Hn  can  now  be  interpreted  as  the  nth-order  coefficient  of  the  Fourier  series 
expansion  of  the  impulse  response. 

Having  introduced  n(v  +  u,u),  the  impulse  response  function  of  a  LPTV  system, 
first  let  us  briefly  review  some  of  its  properties.  To  highlight  its  properties,  we  first 
compare  it  to  the  impulse  response  of  a  conventional  LTI  system.  Let  us  first  define, 
in  Figure  4.19,  the  launch  time  u,  which  is  defined  as  the  time  when  we  launch  an 
impulse.  The  observe  time  v,  is  defined  as  the  time  we  wait  from  the  moment  when 
we  launch  an  impulse  to  the  moment  when  we  observe  the  response  of  the  system. 

Notice  that  for  the  LTI  system  in  Figure  4.19,  the  unique  feature  is  that, 
no  matter  when  you  launch  your  impulse  (in  this  case  u  ^  u ),  as  long  as  you  wait 
for  the  same  amount  of  time  before  you  observe  (v  are  the  same),  you  see  the  same 
response,  or  h  (=impulse  response  function)  are  identical. 

On  the  other  hand  the  same  cannot  be  said  of  a  linear  time  varying  (LTV) 
system.  Here  if  we  start  with  different  launch  times,  that  is,  u  ^  u  ,  even  though  v  is 
the  same,  h  will  be  different.  How  about  a  LPTV  system,  the  special  case  of  a  LTV 
system  that  is  specifically  applicable  to  that  of  a  mixer,  our  present  focus?  As  with  a 
LTV  system,  in  general  h  is  not  the  same.  However,  under  a  special  circumstance 
h  will  be  the  same.  The  special  circumstance  happens  when  the  launch  time  differs 
by  exactly  the  time  T,  the  period  with  which  the  system  is  repeating  itself,  as  shown 
in  Figure  4.20. 

Now  that  we  have  finished  highlighting  some  qualitative  differences  between 
LTI,  LTV,  and  LPTV  systems,  we  will  look  at  a  more  rigorous  definition  of  the 
LPTV  system.  It  should  be  noted  that  the  impulse  response  function  h(y+u,  u)  of  a 
LPTV  system  is  periodic  in  u.  Hence  we  can  also  express  h(v+u,  u)  as  h(t,u). 
From  now  on  we  will  use  either  expression  for  the  impulse  response,  depending 
on  which  one  is  more  convenient  under  the  particular  situation.  What  is  the 
property  of  the  Fourier  transform  of  h?  Since  h  is  a  function  of  two  variables, 
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Fig.  4.20    Behavior  of  impulse  responses  of  a  LPTV  system 

its  Fourier  transform  is  also  a  function  of  two  variables  or  it  is  a  two-dimensional 
Fourier  transform: 


H(coif,cos)  =  — 


h(t,  u)e>m°"  ■  e-jm''dudt  (4. 136) 


—  oo  — oo 


Here  (Ojf  is  the  IF  and  cos  is  any  arbitrary  frequency.  What  is  the  property  of  this 
two-dimensional  Fourier  transform?  Since  h  is  periodic  in  one  variable,  u,  after 
taking  its  Fourier  transform  with  respect  to  u,  the  resulting  function  is  still  periodic 
in  t.  Hence  doing  the  Fourier  transform  of  the  resulting  periodic  function  with 
respect  to  t  reverts  to  expanding  that  function  in  a  Fourier  series  and  finding  its 
Fourier  series  coefficient.  Since  the  Fourier  series  coefficients  are  discrete  (as 
opposed  to  being  continuous,  as  in  the  Fourier  transform  case)  the  final  function 
is  labeled  Hn,  where  n  are  integers. 

In  what  follows  we  will  start  from  (4.136)  and  actually  derive  Hn.  Our  goal  is  to 
show  that  Hn  can  be  interpreted  to  be  similar  to  H,  the  system  transfer  function  of  a 
LTI  system.  Before  that  we  need  to  do  some  more  clarifications  on  h(t,u)  and  also 
adjust  its  time  axis  though. 

First  let  us  clarify  what  does  it  mean  for  h(t,u)  to  be  a  function  of  two  (rather  than 
one)  variables?  Let  us  reassert  that  h(t,  u)  is  the  system  output  that  we  observe  at 
time  t,  where  t  spans  from  t  —  0  to  t  —  oo,  for  a  given  launch  time  u.  This  is  plotted 
in  Figure  4.21  (a).  That  is,  when  the  impulse  is  launched  at  U\,  h  varies  as  a  function 
of  time  t  in  one  particular  fashion  (in  our  example  a  fast  rise  followed  by  a  slow 
fall).  When  the  impulse  is  launched  at  u2,  notice  h  also  varies  as  a  function  of  time, 
but  in  a  different  fashion,  as  shown  in  Figure  4.21  (c)  (a  medium  rise,  followed  by  a 
medium  fall).  This  is  what  we  mean  by  h(t,u)  is  a  function  of  two  variables. 

Secondly  we  want  to  readjust  the  time  axis  of  h(t,u)  because  the  present  time  axis 
is  not  very  convenient.  Why?  This  is  because  u  may  be  larger  than  0,  in  which  case 
there  is  a  time  span  t  from  0  to  u,  before  the  impulse  is  even  launched,  when  we  are 
observing  the  output.  Any  output  being  observed  in  this  time  span  is  meaningless. 
[In  our  example  in  Figure  4.21  (a),  (c)  we  simply  assume  h  —  0  at  those  instances, 
that  is,  h(uu  ux)  —  0  and  h(u2,  u2)  =  0.]  A  more  convenient  way  is  to  shift  the  time 
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Fig.  4.21    (a),  (c)  Behavior  of  h  as  a  function  of  launch  time,  (b),  (d)  Corresponding  g 

scale  to  v  so  that  the  output  always  start  to  be  monitored  at  time  zero,  which  is 
also  the  time  when  we  launch  the  impulse.  Therefore,  we  must  have  v  =  t  -  u.  Since 
the  time  when  the  impulse  is  launched  is  u,  which  is  another  variable,  so  this 
shift  in  time  axis  is  not  constant  but  is  a  function  of  the  launch  time.  Using  this  new 
time  axis  means  that  we  will  now  be  doing  observation  at  v  >  0,  which  is  also  the 
time  when  the  output  is  meaningful. 

Now,  on  this  new  time  scale  we  want  to  define  the  impulse  response  function, 
denoted  as  g,  which  must  equal  the  old  impulse  response  function  h  on  the  old  time 
scale.  To  find  their  relationship  a  convenient  point  is  when  they  both  must  be  zero. 
We  know  h(u,  u)  must  be  zero  (since  right  after  we  launch,  the  output  is  zero)  and  g 
(0,  u)  must  be  zero  (since  by  our  choice  we  want  this  new  function  to  always  start  out 
to  be  zero).  Hence  g(0,u)  —  h(u,  u).  Since  the  two  functions  are  related  only  by  a  shift 
of  time  axis  and  nothing  else  (e.g.  time  scale  does  not  expand  or  contract),  then 
they  will  be  identical  hereafter,  each  on  their  own  time  axis  and  so  g  (0  +  v,  u)  = 
hit  +  u,  u)  or  g{y,  u)  =  hit  +  u,u).  Pictorially  this  means  that  we  take  the  original  plot 
of  hit,  u),  for  each  different  u,  then  slide  along  the  axis  by  a  different  amount, 
corresponding  to  the  different  u,  and  then  put  each  of  these  plots  individually  on 
the  v-axis.  They  all  start  out  from  v  =  0  and  we  have  created  the  equivalent  g  plot. 
For  example  the  h  plots  in  Figure  4.21  (a)  and  (c)  are  re-created  as  g-plots  in 
Figure  4.21(b)  and  (d),  respectively,  following  this  procedure. 

Having  clarified  h(t,u)  and  adjusted  its  time  axis,  let  us  start  deriving  Hn.  As  we 
stated  before  we  start  from  (4.136)  and  substitute  v  =  t  -  u  and  hit,  u)  =  g(v,  u)  and 
H  becomes 


H(C0jf,CQs)  = 


1 

2n 


g(v,  u)e>m°u  ■  e-jco^v+u)dudv 


(4.137) 


—  oo  — oo 
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H(a>if,cos)  = 


2n 


g{v,u)Aa,*-(0*>  ■  e-ja»fvdudv 


(4.138) 


Since  g(v,  u)  is  periodic  in  u  with  periodicity  T  (the  period  of  the  LO),  it  can  be 
expressed  as  a  Fourier  series  with  angular  frequency  colo  =  2n/T: 


g(v,u)  =    Yl  8n(v)e~ 


-jn(0/ott 


(4.139) 


where 


g»(v)=f     g(y,Uynm'"udu  (4.140) 

o 

Next  let  us  substitute  (4.139)  into  (4.138),  and  upon  simplification  we  have 

oc 

oc 

H(wif,  cos)  =         J2  8n(v)5{a)s  -  Q)if  -  ncolo)e-jm-fvdv  (4.141) 


Exchanging  the  order  of  integration  and  summation,  we  can  rewrite  (4.141) 
in  the  following  form: 


H(wif,(os)  =   Y  Hn(a>if)5(cos  -  (oif  -  nco,0) 

n——oc 

where  we  have  defined  a  new  function  Hn  as 


(4.142) 


Hn(mif) 


gn{v)e-j0,'fvdv 


(4.143) 


Note  that  this  function  is  a  function  of  n  and  coif.  Furthermore  it  is  related  to  the 
Fourier  coefficient  gn  of  the  impulse  response  g(v,  u)  of  the  system.  Therefore,  we 
call  it  a  system  function,  and  this  is  the  Hn  we  have  been  seeking. 

As  stated  above  we  want  to  show  this  Hn  is  similar  to  H,  the  system  transfer 
function  of  a  LTI  system.  To  do  this  we  have  to  show  that  Hn  relates  the  input/ 
output  spectra  of  a  LPTV  system  in  a  fashion  similar  to  the  way  H  relates  the 
input/output  spectra  of  a  LTI  system.  That  means  we  have  to  investigate  the 
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relationship  between  the  output  and  input  spectra  of  a  LPTV  system.  We  start  off 
our  investigation  in  the  time  domain.  Similar  to  a  LTI  system,  the  output  of  an 
LPTV  system  in  the  time  domain  is  related  via  the  impulse  response  of  the  system 
to  the  input  by  the  convolution  integral: 


y(t) 


h{t,u)x{u)  du  (4.144) 


Now  going  back  to  the  frequency  domain  the  output  spectrum  is  related  to  the 
input  spectrum  by  taking  the  transform  of  (4.144): 


Y((o)  = 


H(co,  cos)X(cos)  dms  (4.145) 


Here  co  and  cos  are  just  two  arbitrary  frequencies  where  we  want  to  use  as  the 
basis  to  observe  the  output  frequency  spectrum  (analogus  to  two  arbitrary  times, 
t  and  u  in  the  time  domain).  Normally  though,  if  we  are  not  interested  in  the  whole 
spectrum,  but  just  the  response  at  a  particular  frequency,  we  can  set  the  variable  co 
to  just  that  frequency.  In  the  present  case  the  frequency  response  at  coif  is  of 
particular  interest  and  so  we  set  co  equal  to  coif  and  (4.145)  becomes 


Y((oif)=       H((oif,(os)X((os)d(os  (4.146) 

— OO 

Meanwhile  Htcoif^cOs)  is  related  to  Hn  via  (4.142).  Therefore,  we  substitute 
(4.142)  into  (4.146),  and  after  some  algebraic  manipulation  we  have 

oc 

Y{(°if)  =    ^2    Hn(c°if)X(c°if  +  nOJLO) 

n——oo 

oc 

and  |y(ffl,7)|2=   J2   \Hn(coif)\2\x((oif  +  nmLO)\2  (4.147) 

n— — oc 

This  is  essentially  the  same  form  as  (4.134)  if  we  map 

\Y(cQif)  |  =  5„3  (cQjf)  and  \X(coif  +  nmL0)  |  =  Sn2(coif  +  ncoLO)  (4.148) 

Examining  (4.147)  we  finally  have  shown  that  //n(coif)  so  defined  relates  the 
input/output  spectra  of  a  LPTV  system  in  a  fashion  similar  to  the  way  H  relates 
the  input/output  spectra  of  a  LTI  system.  This  justifies  calling  Hn  a  system  function 
in  the  sense  it  has  similar  property  as  the  system  transfer  function  H. 
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To  close  off  the  discussion,  if  in  (4.140)  we  substitute  g(v,  u)  by  hit  +  u,  u) 
and  then  substitute  the  resulting  equation  into  (4.143)  we  have  Hn  as: 


h(v  +  u,  u)eina"°udu  ■  e'^dv 


OO 

T 

Hn(coif) 

= 

1 
T  . 

X!            0 

1 

■T  r 

~f 

o  L 

h(v  +  u,u)e~joJifVdv 


ej,mi°"du  (4.149) 


This  agrees  with  the  original  form  we  have  stipulated  for  Hn  in  (4.135). 

We  have  now  finished  our  discussion  of  a  LPTV  system.  Next  we  want  to  apply 
the  results  derived  to  the  specific  LPTV  system  we  are  interested  in,  the  mixer  stage 
2  as  described  in  Figure  4.17  and  Figure  4.18.  In  particular,  we  want  to  find  the 
Hn  of  this  specific  LPTV  system. 


4.7.2.2     Special  Vlo  Switching  Case 

It  is  normally  hard  to  get  a  close  form  solution  for  the  H„  of  a  LPTV  system. 
Simulators  such  as  spectra  RF  [7]  are  typically  used  instead.  Occasionally  if  the 
LPTV  system  is  simple  enough,  close  form  solution  is  available.  For  example,  we 
will  attempt  to  find  the  close  form  solution  of  Hn  for  a  simple  mixer  in  problem 
4.12(b).  Alternatively  when  a  LPTV  system  (such  as  mixer  stage  2)  operates  under 
a  special  circumstance,  its  H„  can  be  determined  rather  easily. 

To  find  out  what  the  special  circumstance  is,  we  first  derive  a  LTI  system  by 
applying  a  constant  signal  (its  value  equals  the  average  value  of  Vlo)  to  the  mixer 
stage  2,  a  LPTV  system.  This  LTI  system  contains  poles  and  zeroes  whose  lowest 
frequency  is  denoted  as/minpolezero. 

If  the  condition 

,/min_pole_zei"o  ^>  / lo  (^.IjUJ 

is  satisfied,  then  we  are  operating  under  the  special  V\0  switching  case.  The  LPTV 
system  becomes  a  special  LPTV  system.  Hn  for  this  special  LPTV  system  becomes 
rather  easy  to  find. 

For  example,  a  mixer  with  an  impulse  response  hit,  u),  described  in  Figure  4.22, 
satisfies  this  condition.  For  this  mixer,  once  we  fix  the  launch  phase,  the  poles  and 
zeroes  of  the  resulting  LTI  system  that  corresponds  to  this  launch  phase  are  all  at 
much  higher  frequencies  than  fi0.  (Notice  that  once  the  launch  phase  is  fixed,  the 
mixer  is  an  LTI  system  and  the  concept  of  poles  and  zeroes  applies).  How  do  we 
know  that?  We  find  that  this  is  true  by  examining  the  hit,  u)  corresponding 
to  different  launch  times.  Six  of  them  are  plotted  in  Figure  4.22,  corresponding  to 
launch  times  ux  to  u6.  From  Figure  4.22,  notice  that  in  all  six  cases  hit,  u)  rises  and 
falls  rapidly,  when  compared  with  T  (the  LO  period).  Since  the  rise  and  fall  time  is  a 
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u6        0.5Tlo 


Fig.  4.22    Impulse  response  of  a  mixer  operating  under  the  special  V\0  switching  case 


function  of  poles  and  zeroes,  this  observation  indirectly  tells  us  that  the  poles  and 
zeroes  are  at  frequencies  much  higher  fhan/i0.  It  is  also  precisely  this  property  that 
lets  the  general  shape  of  h(t,  u)  to  be  rather  independent  of  the  launch  time  u,  with 
only  the  peaks  decaying.  Specifically,  in  this  example  h(t,  u{),  h(t,  u£),  h{t,  uj), 
h(t,  «4),  h(t,  m5),  h(t,  w6)  all  have  similar  shapes,  except  that  their  peaks  decrease  as  u 
goes  from  ux  to  u6.  In  the  circuit  sense,  satisfying  (4.150)  means  that  since  the  LO 
signal  changes  slowly,  the  system  can  be  treated  as  a  time  invariant  system. 

Next,  if  mixer  stage  2  of  an  unbalanced  mixer  does  satisfy  (4.150)  and  can  be 
represented  by  a  special  LPTV  system,  how  do  we  represent  this  special  LPTV 
system?  For  illustration  purposes  we  assume  that  the  LO  signal  is  a  sine  wave,  or  V\0 
=  Alo  cos(colof),  and  hence  the  internal  operating  points  change  in  a  sinusoidal 
fashion,  but  change  in  such  a  way  that  the  locations  of  poles/zeroes  remain 
practically  the  same.  Hence  the  shapes  of  the  h(t,  u)  remain  the  same  as  u  changes. 
Of  course,  since  the  internal  operating  points  change,  the  peaks  of  h(t,  u)  change 
accordingly.  Following  [3],  mixer  stage  2,  which  is  just  a  SCP,  has  switching 
characteristics  as  described  by  the  S-function  in  Figure  4.2.  It  is  assumed  that  Alo 
is  much  larger  than  V+  and  V _.  Hence  when  the  impulse  is  launched  at  a  time  when 
the  LO  signal  is  at  its  minimum,  the  peak  of  the  impulse  response  is  at  its  minimum 
(0).  Conversely,  when  the  impulse  is  launched  at  a  time  when  the  LO  signal  is  at  its 
maximum,  the  peak  impulse  response  is  at  its  maximum  (peak_max).  If  we  plot  the 
impulse  response  of  mixer  stage  2,  denoted  as  /*mixer_stage2(?>  w),  men  it  will  t>e 
shown  to  exhibit  a  trend  similar  to  the  h(t,  u)  of  the  mixer  as  described  in  Figure 
4.22.  For  example,  let  us  assume  that  an  impulse  is  launched  at  the  same  time  as  u6 
in  Figure  4.22.  We  assume  the  mixer  described  in  Figure  4.22,  and  also  the  mixer 
stage  2  use  the  same  V\0  and  have  the  same  LO  period.  Hence,  similar  to  Figure 
4.22,  we  assume  that  this  launch  time  is  close  to  1/2  of  Tio.  At  that  time  the  LO 
signal  is  close  to  Alo  cos(180°)  or  close  to  -Alo.  The  LO  signal  is  at  its  minimum 
value.  Hence  a  minimum  LO  signal  is  applied  to  mixer  stage  2  in  Figure  4.17.  Since 
mixer  stage  2  corresponds  to  the  SCP  Mx_2  of  the  unbalanced  mixer  in  Figure  4.3, 
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Fig.  4.23    An  equivalent  representation  of  mixer  stage  2  in  the  special  Vi0  switching  case 


a  minimum  LO  signal  means  that  Vi0+  is  at  its  most  negative  value  and  Vlo"  is  at  its 
most  positive  value.  Consequently  transistor  M;  in  Figure  4.3  is  off.  Correspond- 
ingly the  impulse  injected  at  the  input  of  mixer  stage  2  in  Figure  4.17(which 
corresponds  to  the  source  of  Mx)  does  not  get  propagated  through  and  the  impulse 
response  should  have  a  small  peak  (close  to  0)  or  /!mixer_stage2(/>  u6)  has  a  small  peak. 
This  is  similar  to  h(t,  u6)  as  described  in  Figure  4.22. 

To  further  demonstrate  the  similarity  between  hmixer_stage2(t,u)  and  h(t,u)  we 
now  launch  an  impulse  to  mixer  stage  2  at  the  same  time  as  u1  in  Figure  4.22.  As  in 
Figure  4.22,  this  launch  time  is  close  to  the  beginning  of  T  and  so  the  corresponding 
LO  signal  is  close  to  Alo  cos(0°)  or  close  to  Alo.  The  LO  signal  is  at  its  maximum  value 
and  transistor  Mx  is  fully  on.  The  impulse  gets  propagated  through  and  the  impulse 
response  should  have  a  maximum  peak  (close  to  peak_max)  or  /7mixei_stage2(^  u\)  has  a 
maximum  peak.  This  is  similar  to  h(t,  m;)  as  described  in  Figure  4.22.  Now  for  impulses 
launched  from  times  u2-u5,  the  SCP  M1_2  in  Figure  4.3  is  operating  in  the  linear  region 
of  Figure  4.2  (between  V+  and  VJ)  and  Mj  is  partially  on.  Accordingly,  the  degree  in 
which  M]  is  on  changes.  Hence  the  peak  response  also  decreases  accordingly  from  u2 
to  Us,  which  exhibits  a  similar  trend  to  the  h(t,  u)  of  the  mixer  as  shown  in  Figure  4.22, 
where  the  decreasing  peaks  are  connected  by  a  dotted  line. 

To  summarize,  the  peak  of  the  impulse  response,  /*mixer_stage2(^  u)  f°r  mixer 
stage  2,  corresponding  to  an  impulse  launched  at  time  u,  is  proportional  to  5,[Vi0(m)] 
or  S[Alo  cos  co0u]  in  Figure  4.2.  To  model  this  behavior,  the  mixer  stage  2  is  now 
represented  by  the  equivalent  system  as  shown  in  Figure  4.23.  This  equivalent 
system  consists  of  a  cascade  of  two  blocks.  The  first  block  is  a  LTI  system  whose 
impulse  response,  hLTh  is  obtained  from  mixer  stage  2  by  fixing  the  launch  time  at 
u  =  uq,  or  /zlti(v)  =  hmixer  stage2(v,  uq).  Here  wq  stands  for  launch  time  when  mixer 
stage  2  is  in  its  quiescent  state.  We  further  assume  that  mixer  stage  2  is  at  its 
quiescent  state  when  V\0  is  midway  between  its  minimum  and  maximum  value. 
(Physically  to  obtain  /zlti>  we  take  mixer  stage  2,  apply  a  constant  value  that  equals 
the  average  of  V\0  at  the  LO  port,  apply  an  impulse  at  the  input  and  the  response 
obtained  equals  Vti-)  Returning  to  Figure  4.23  the  output  from  this  LTI  system  is 
fed  to  the  second  block,  a  gain  block  whose  gain,  gain(u),  is  a  function  of  launch 
time  u.  This  gain  block  is  used  to  model  the  dependency  of  the  peak  of  the  impulse 
response  on  Vlo  and  hence  gain  (u)  is  also  proportional  to  S[V[0(u)].  With  this 
equivalence,  we  can  set  hmixer  sta?e2(v  +  u,  u)  to  /zLTI(v)gain(w). 
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Fig.  4.24    Mixer  stage  2  is  represented  as  a  special  LPTV  system.  The  gain  part  of  this  special 
LPTV  system  is  plotted  as  a  function  of  launch  time 


Using  this  equivalence  model,  we  can  calculate  Hn  of  mixer  stage  2.  Let  us  start 
from  (4.149)  and  set  h(v  +  u,  u)  to  /zLTI(v)gain(M).  Hence 


H„  (coif) 


1 


l-T 


hLTr(v)gain(u)e  JLOifVdv 


e>ncj>°udu 


hLTi{v)e-jWifvdv 


gain(u)eina"-udu 


l-T 


Hlti  (cOjf) 


gain{u)eino"°udu 


(4.151) 


Remember  that  gain(«)  is  proportional  to  S[A\Q  cos  co0m].  What  is  S[A\Q  cos  co0m]? 

We  have  stated  before  that  the  characteristics  of  mixer  stage  2  of  the  unbalanced 
mixer  is  described  by  Figure  4.2.  If  we  apply  Alo  cos(co0m)  to  the  LO  port  of  mixer 
stage  2  of  this  unbalanced  mixer,  S[Alo  cos  a0u]  can  be  obtained  from  Figure  4.2. 
Specifically,  if 


A,0  »  V+  and  Alo  »  V_ 


(4.152) 


then  from  Figure  4.2  the  mixer  stage  2  is  seen  to  behave  in  such  a  way  that  for 
almost  50%  of  the  time  M x  in  Figure  4.3  is  turned  off,  and  the  gain  is  0;  and 
for  almost  the  other  50%  of  the  time,  M]  is  fully  on,  and  the  gain  is  at  its  maximum 
value,  gain_max.  Therefore,  gain(w)  varies  as  a  square  wave  as  shown  in 
Figure  4.24.  Now  we  want  to  verify  that  gain_max  is  2,  as  depicted  in  Figure  4.24. 
This  is  done  by  applying  (4.151)  to  the  case  when  the  mixer  is  not  switching. 
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Conceptually,  nonswitching  means  that  Hn  =  //lti-  This  means  that  in  (4.151)  the 
left-hand  side  becomes  //lti-  How  about  the  right-hand  side?  Note  that  for  the 
nonswitching  case,  conceptually  this  also  means  <»lo  =  0.  Then  the  right-hand 
side  becomes  HLTI  ■  i  J0  gain(u)  du.  Applying  the  function  for  gain(w)  as  shown  in 
Figure  4.24  to  this  expression,  it  becomes  Hun  gg'"2max .  Hence  (4.151)  becomes 
Hm=Hm-^^.  Solving, 

gain_max  =  2  (4.153) 


Using  this  value  of  gain_max,  gain(u)  can  be  written  as: 

gain(u)  =  2TlQ  +  l  (4.154) 

where  11(f)  =  1/2  for  u  between  -774  and  774  and  -1/2  between  774  and  3/4T. 
Substituting  (4.154)  into  (4.151)  and  solving  [4],  we  have 

Hn=HLTl  sin c^  (4.155) 

In  summary  we  have  determined  Hn  for  mixer  stage  2  under  the  special 
Vio  switching  case. 

Looking  at  (4.155)  we  can  see  that  Hn  goes  down  rather  rapidly  as  n  increases. 

As  an  example,  n  =  0,  Hq  =  Hui-  When  n  =  1,  we  have  sine  ^  =  — y2  —  \  ~  0.63. 

Accordingly,  Hx  w  0.63/7m  or  ^  =  \  «  0.63 
Therefore, 

H0  =  HLT1     \Hi  |  w  0.63//m  (4. 156) 


4.7.3    Analysis  of  Noise  in  Unbalanced  Mixer  f  Vto  no?  Switching 
and  Special  \to  Switching  Cases) 

Assume  that  we  are  investigating  the  unbalanced  mixer  as  shown  in  Figure  4.3. 
Furthermore,  assume  that  this  unbalanced  mixer  operates  at 

RF   =    1.9  GHz,  BW   =    1.7  MHz 

LO  frequency   =  1.8  GHz,  IF   =    100  MHz 
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and  Vlo  =  A]0  cos  m0u.  Hence  its  average  value  is  0.  Alo  is  set  to  be  much  larger 
than  V+  and  V_. 

The  mixer's  V-I  converter  M3  has  Vgs,  —  V,  =  5.1  V,  switch  Mx_2  has  an  over- 
drive voltage,  Vgsi-^i  =  IV; 

W  W 

M\-2  has  a  —  of  50  um/0.6um;  M3  has  a  —  of  3.8  um/0.6um. 

The  unbalanced  mixer  is  built  with  a  0.6  urn  CMOS  technology  that  has  the 
following  characteristics: 

k'  =  75  uA/V2 

Cox  =  IfFjum2 

In  this  sub-section  we  are  interested  in  finding  this  mixer's  NF. 

Let  us  represent  this  unbalanced  mixer  as  shown  in  Figure  4.17.  Again  we 
assume  that  all  the  noise  comes  from  stage  1  and  neglect  any  noise  contribution 
from  stage  2.  We  further  divide  our  discussion  into  two  scenarios: 

1 .  Mixer  stage  2  is  operating  under  the  Vi0  not  switching  case. 

2.  Mixer  stage  2  is  operating  under  the  special  V\Q  switching  case. 

The  calculation  of  NF  is  divided  into  three  steps. 


4.7.3.1     Step  1:  Calculate  Sn2 

In  step  1  we  calculate  the  PSD  of  the  noise  from  stage  1.  This  is  because  for 
both  scenarios  1  and  2  the  primary  noise  source  comes  from  this  stage.  Now  stage 
1  consists  of  a  V-I  converter,  so  let  us  go  through  a  simple  example  of  noise's  PSD 
calculation  for  this  V-I  converter. 

From  (4.133)  the  PSD  of  this  V-I  converter  input  referred  voltage  noise  is 
given  by 

S„i=^-  (4.157) 

We  are  given  that  (j;)3  is  3.8um/0.6um.  We  are  also  given  Vgsi  —  Vt  =  5.1V, 
Id  =  75  uA/V2.  Hence 

gm  -  kf(W/L)3(VGS3  -  V,)  =  2.4mfi-1  (4.158) 
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Substituting  (4.158)  into  (4.157)  and  taking  the  square  root,  we  have 


,  UT 


3,?, 


;»3 


/4;d0-21  1  V 

2.8W x 


VHz 
Referring  to  Figure  4.17,  we  have 


3  2.4  x  10-3  y/Hi 

2.1  nV 

(4.159) 


2 


S,a  =  Snl  x  ^  =  (™*f\   x  (2.4  wQ->)2=  £M-  (4.160) 


4.7.3.2     Step  2:  Find  HLti 

In  step  2  we  are  supposed  to  first  find  the  average  value  at  the  LO  port  in  order  to 
calculate  HLTi  for  both  scenarios  (1)  and  (2).  Fortunately  this  average  value  is  given 
at  the  beginning  of  sub-section  4.7.3  to  be  0  and  so  there  is  no  need  to  calculate. 

With  this  average  value  Z/lti  f°r  both  scenarios  ( 1 )  and  (2)  are  the  same  and  can 
be  found  by  applying  an  average  value  of  0  V  at  the  LO  port.  If  we  do  that  and 
examine  Mx_2  in  Figure  4.3,  we  find  that  Mi_2  is  a  SCP  that  is  balanced.  Using  the 
half-circuit  concept,  each  transistor  operates  like  a  common  gate  (CG)  stage,  with 
current  input  and  current  output.  The  magnitude  of  the  low-frequency  gain  is  Vi 
since  the  current  splits  evenly  between  two  branches  and  ia  only  gets  Vi  of  ir{  and  so 

|//lti(0)|  =  72  (4.161) 

Also,  for  a  CG  stage  with  current  input  and  output,  there  is  one  dominant  pole 
whose  frequency  is  given  approximately  by 


8  ml 

Lo.sl 


(4.162) 


Therefore  the  -3dB  frequency  (/_3db)  of  HLti  is  the  same  as  this  pole  frequency. 
To  calculate  numerically  this  pole  frequency,  we  first  find  Cgsi 

CgA  =  WxLxCox  (4.163) 


We  are  given  a  (W/L)]  of  50u/0.6u  and  a  Cox  of  2fF/um2.  Substituting  we  have 
Cgsl  =  50um  x  0.6um  x  IfFjum1  =  60 fF  (4.164) 
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Fig.  4.25    Mixer  stage  2  is  represented  as  a  special  LPTV  system.  Frequency  response  of  the  LTI 
system  part  of  this  special  LPTV  system  is  shown 


Next  we  calculate  gml  We  are  given  an  overdrive  voltage  VGS1  -  Vt  of  1  V,  (W/L)i 
of  50um/0.6um  and  k'  =  75  umA/V  .  Hence 

gmi  =  k'iW/L^iVosi  ~  Vt)  =  6.2/71Q-1  (4.165) 

Substituting  (4.164),  (4.165)  in  (4.162),  we  finally  find  the  numerical  value  of 
the  pole  frequency 


<a,  =  |^  «  lOOGrad/sorf,  w  16.5GHz 


(4.166) 


Using  (4. 161),  (4. 166)  we  can  obtain  the  Bode  plot  for  the  magnitude  response  of 
HLTI,  as  shown  in  Figure  4.25. 


4.7.3.3     Step  3:  Find  NF 

In  step  3  we  calculate  the  NF  for  both  scenarios:  V\0  not  switching  and  special 
Vio  switching  cases. 


Scenario  (1):  Vlo  not  switching  case 

Mixer  stage  2  can  be  treated  as  a  LTI  circuit  whose  frequency  response  is  the  same 
as  that  of  HLTI,  which  is  shown  in  Figure  4.25.  Hence  we  have 


S„3  =  |/WlOOM//z)|2S„2 


(4.167) 
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From  Figure  4.25,  7/LTI(100  MHz)  =  //LTi(0)  which  in  turn  is  given  in  (4.161) 
to  be  '/2.  From  (4.160)  S„2(100  MHz)=  ^-.  Substituting  in  (4.167)  we  have 

5„3  =  0.25   x   ^5pA>  (4.168) 

Hz 

Now  what  is  the  NF  of  the  mixer  for  this  scenario,  assuming  a  50  Q  RJ  NfaV, 
input  referred  noise  of  the  mixer,  is  given  by 

Ndev= *5 0.25,(5PAf/HzJ.25,(5PAf/Hz 

(HLTI(l00MHz)xgmi)2  (0.5xgm)2  (0.5X2.4WQ-1) 

Using  (2.67)  and  noting  Ndev  in  (4.169)  equals  NDevicJG  in  (2.67)  we  have 

NF=  l   ,  Ndey  =  i   |    Ndn.    =  1   |    /0.25  x  (5pA)2/Hz\ 

N Source-resistance  4kTRs  \(0'5  X  2.4otQ_1)2/ 

1 

X  4  x  4  x  10-21  x  50Q///Z 
=  1  +  5.625  =  6.625  =  &.2dB  (4.170) 

This  is  slightly  better  than  the  12  dB  of  NF  assigned  to  this  mixer  in  Chapter  2 
(see  Table  2.1,  row  4,  column  5). 


Scenario  (2):  Special  V\Q  Switching  case 

Let  us  refer  to  Figure  4. 17  again,  where  mixer  stage  2  is  a  LPTV  system  with  S„2  as 
the  input  noise  PSD  and  S„t,  as  the  output  noise  PSD.  Using  (4.134)  and  we  have 


S„?,(0Jif)  =     22     \H"{W't)\    Sn2{tt>if  +  n0Jio) 

n——oc 

=  \H0(cOif)\  S„2(coif)  +  |//i(co,/)|  S„2(u)if  +  oj/o)  +  |//_i|25„2(cu,/--  colo)  +  ... 

(4.171) 

In  scenario  (1)  we  have  derived  a  LTI  system  from  this  LPTV  system  by  applying 
a  constant  signal  to  its  LO  port.  Its  transfer  function  has  a  frequency  response  as 
given  in  Figure  4.25.  Since  this  LTI  system  has  only  one  pole,  its /min_pole_zero  is 
given  by  that  pole  frequency.  This  was  calculated  in  (4.166)  to  be  16.5  GHz. 
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Substitute  this  valueandagiven/loof  1.8  GHz  in  (4.150)  and  it  is  seen  that  (4. 150)  is 
justified  (/min_poie_zero  is  about  10  times  higher  than/i0).  Hence  mixer  stage  2  is 
operating  under  the  special  Vlo  switching  case  (sub-section  4.7.2.2)  and  therefore  we 
can  find  Hn  by  using  (4.151). 

In  addition,  we  are  given  at  the  beginning  of  sub-section  4.7.3  that  Alo  >>  V+  and 
^io  ^>  V-  and  hence  (4.152)  is  satisfied.  Then  (4.155)  applies,  which  is  repeated  for 
reference: 


Hn=Hmanc~  (4.172) 

It  can  be  seen  that  Hn  goes  down  rapidly  as  n  increases,  so  as  a  simplification  we 
include  only  the  H0,  Hi  and  H.x  terms  and  ignore  higher  orders  terms  in  (4.171). 
H0,  Hi  and  H_y  can  be  obtained  from  (4.156),  repeated  here  for  reference: 

H{)=HLTI     \Hi\^0.63HLTI     |#_i|  » 0.63HLTi  (4.173) 

Substitute  (4.173),  (4.160)  into  (4.171),  ignoring  higher  order  terms,  we  can  then 
find  5„3  at  100  MHz  as  follows: 

S„3  =  \HLTI(100MHz)\2^5pA>    +  |0.63//Lr/(100  MHz)\2  ^  pA>    x  2     (4.174) 

We  have  calculated  in  scenario  (1)  that  7/Lti  (100  MHz)  =  Vi.  Substituting  in 
(4.174),  we  have 

Hz  Hz 

=  0.45x(5^  (4.175) 

Hz 

Compared  to  (4.168),  one  can  see  that  including  the  periodic  time  varying  effect 
introduced  in  the  special  Vlo  switching  case  actually  increases  the  noise  by  about 
80%.  Of  course,  if  we  have  significant  noise  contribution  at  the  frequencies  where 
Hi,  H_i  will  alias  the  noise  in  (in  this  case  at  1.9  GHz,  -1.7  GHz),  then  we  can  see 
from  (4.171)  that  including  the  periodic  time  varying  effect  is  crucial.  Hence  under 
those  circumstances  where  there  is  significant  wideband  noise  (not  filtered  out  by 
any  previous  filters),  adopting  the  noise  calculation  incorporating  periodic  time 
varying  effect  is  particularly  useful. 

Finally,  what  is  the  NF  of  the  mixer  for  this  scenario?  (We  assume  that  the 
concept  of  equivalent  input  noise,  already  derived  for  an  LTI  system,  just  carries 
over  to  the  LPTV  system  and  therefore  the  concept  of  NF  simply  carries  over. 
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The  conversion  gain  is  simply  assumed  to  be  the  same  as  the  conversion  gain  in  the 
LTI  case.)  Naev,  input  referred  noise,  is  given  by 

5„3  0.45  x(5PA)2  /Hz     0.45  x(5PA)2  /Hz 

Ndev  — T  = 1 — T      (4.1/0) 

(HL17(lOOMHz)xgm)2  {0.5xgm)2  (0.5X2.4WQ-1)2 

Hence 

NF=  l   ,  Nder  =  l   |    Ndev    =  1   |    /0.45  x  (5pA)2/Hz\ 

Nsowce-resistmce  4kTRs  I  (0.5  X  2.4  fflQ~')    / 

1 


X  4  x  4  x  10-21  x  50  Q/Hz 


1  +  10.12=  11.12=  10.5  dB 


(4.177) 


This  is  slightly  better  than  the  12  dB  of  NF  assigned  to  this  mixer  in  Chapter  2. 
We  will  show  in  problem  4.13)  that  if  there  is  a  strong  noise  component  at  1.9  GHz, 
NF  will  become  substantially  worse. 
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Let  us  take  Figure  4.6  and  add  the  proper  biasing  circuitry  and  other  auxiliary  circuits 
and  we  get  Figure  4.26  where  Mb  M2,  M17,  M3-M6  form  the  basic  quad  pair  (or 
Gilbert  mixer).  Cascode  devices  M7,  Ms  are  inserted  between  M3_6  and  Mx_2  to 
provide  isolation  between  the  LO  and  RF  ports.  M9-Mw  form  the  current  sources. 
Together  with  resistors  formed  by  biasing  MT1-MT2  in  the  triode  region  they  are  used 
as  loads.  MT1-MT2  usually  have  resistance  values  much  smaller  than  the  resistance 
values  of  M9-M10.  To  bias  MT1-MT2  in  the  triode  region,  we  use  /gain,  M14  and  Vbiasi  to 
set  V<3i4,  which  is  in  turn  used  to  bias  the  gate  of  MT1-Mj2-  If  ^gain  increases  Vq\a 
decreases,  then  VcsMTl,  VgsMti  increase  and  the  resistor's  resistance  decreases.  The 
conversion  gain,  which  equals  -  x  gMl  x  (Rmtj  \\Rm9)  =-  x  gMl  x  Rmt^  increases. 
These  two  resistors  can  sometimes  be  formed  by  P-diffusion.  Mji-Mj^  are  also  used 
in  the  common  mode  feedback  (CMFB)  loop  formed  from  M\2,  M^,  Mu,  Mg,  M\q. 
When  VgUt  increases  and  V~M  decreases,  Vcmter  stays  constant  and  so  the  loop  does  not 
respond  to  differential  mode  output  change.  If  due  to  mismatch  between  the  top  and 
bottom  transistors  (e.g.  M9  and  Mx  or  M10  and  M2)  both  V+(,  and  Vmlt  increase,  then 
V center  increases  by  the  same  amount.  This  steers  the  current  ICM  into  M13  and  away 
from  M12,  leading  to  a  corresponding  reduction  of  the  current  in  Mu.  Hence  there  is  a 
reduction  of  Vcsn  and  hence  Vgs9^  ^GSm  decrease.  Therefore,  M9,  Mw  pump  less 
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Fig.  4.26    A  complete  active  mixer 

current  into  Mrl,  Mn  and  V+/r,  Vmlt  decrease,  thus  opposing  the  original  increase  in 
K>ut+>  Vout~-  y center  in  steady  state  is  set  to  Vuasi  ■  Ccomp  is  the  compensation  capacitor 
used  to  stabilize  the  CMFB  loop. 


4.9     Problems 


4.1.  Since  the  nature  of  the  signals,  Vi0  and  Vrf,  are  different,  mixers  in  Figure  4.4 
and  Figure  4.5  are  not  exactly  the  dual  of  one  another.  If  one  includes 
parasitic  effects,  such  as  mismatch  between  Mt,  M2,  and  parasitics  capaci- 
tance, such  as  C?c/,  which  mixer  has  larger  feedthrough  (from  LO  to  IF)?  How 
about  reradiation  (from  LO  to  RF)? 

4.2.  Show  mathematically  that  the  mixer  output  current  shown  in  Figure  4.8  is 
balanced  and  does  not  contain  any  RF  feedthrough. 

4.3.  Comment  on  the  validity  of  equation  (4.9).  Under  what  circumstance  would 
the  equation  not  be  true? 

4.4.  Show  that  from  the  equation  UPj,\dBnl  =  Pi\dBm ^  we  can  obtain  the 

equation  A1P32=/lintei.ference2/IM3. 

4.5.  Repeat  Numerical  example  4.3,  this  time  with  an  IIP3  of  -10  dbm  and  a 
power  of  5  mW. 

4.6.  Repeat  cases  1  and  2  as  discussed  in  sub-section  4.6.1.3,  except  this  time  use 
sinusoidal  inputs.  Show  that  there  are  seven  terms  in  the  case  for  two  input 
sinusoids. 
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4.7.  We  have  shown  that  the  shorthand  notation  a2Y(joja)  ■  Y(ja)b)  o  Sj  does 
generate  terms  for  a  =  l,b=  1  that  agree  with  (4.60). 

a.  Repeat  that  for  a  =  1,  b  —  2  and  a  =  2,  ft  =  2.  Show  the  terms  generated. 

b.  Repeat  the  case  when  there  are  three  frequency  components  at  the  input: 
©1,  H>2>  0)3. 

c.  We  are  interested  in  the  third-order  term  in  (4.59),  03  [Y  o  S,-v]  .  Suppose 
there  are  three  input  frequencies:  ©1,  ©2,  ffl3-  Show  the  terms  generated  for 
all  the  cases  corresponding  to  relevant  assignments  of  a,  b,  c.  Repeat  for 
two  input  frequencies. 

4.8.  In  sub-section  4.6.2.4  we  isolated  the  first-order  term  by  working  on  (4.100) 
and  deriving  (4.101).  Try  to  isolate  the  first-order  term  from  (4.99)  instead. 
Show  and  justify  which  of  the  three  terms  in  (4.99)  do/do  not  contribute  to  the 
first-order  term  and  rederive  (4.101). 

4.9.  In  Numerical  example  4.4,  change  Vqsi  -  Vt  to  0.00387  V  while  keeping  the 
rest  of  the  parameters  the  same.  What  is  IM3?  Would  IM3  still  be  3HD3? 

4.10.  After  (4.118)  we  mention  that  G\  in  (4.116)  (derived  using  Volterra  series), 
when  normalized,  agrees  with  ax  in  (4.23)  (derived  using  Taylor  series). 
Verify  this.  What  is  the  normalizing  factor? 

4.11.  This  problem  examines  some  subtleties  of  an  LPTV  system 

a.  For  an  LPTV  system,  let  us  launch  an  impulse  at  u  and  observe  the 
response  at  v.  Now  let  us  delay  by  T  and  observe  the  response  at  v  +  T, 
where  T  is  the  period  with  which  the  system  is  repeating  itself.  Would  we 
get  the  same  response? 

b.  Show  how  (4.147)  is  obtained  from  (4.142)  and  (4.146). 

4. 12.  At  the  end  of  Section  4. 1  we  mention  that  we  can  use  Ron  of  a  MOS  transistor 
biased  in  the  triode  region  as  a  variable  gain  amplifier  and  hence  a  mixer.  This 
mixer  will  be  treated  in  more  detail  in  Chapter  5.  However,  because  of 
its  simplicity,  we  can  use  it  to  illustrate  some  principles  of  an  LPTV  system. 
The  following  are  two  different  implementations  of  this  mixer:  one  case 
loaded  with  a  resistor  (Figure  P4.1)  and  the  other  case  with  a  capacitor 
(Figure  P4.2).  Assume  that  the  Vlo  is  a  square  wave.  Furthermore,  assume 
that  this  mixer  has  a/min_pole_zero  that  satisfies  (4.150). 
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The  mixer  loaded  with  a  resistor  ^termination,  together  with  the  equivalent 
network,  is  shown  in  Figure  P4.1.  Notice  that  in  the  equivalent  network, 
the  resistor  with  an  arrow  is  used  to  indicate  a  linear  but  time-varying 
resistor.  This  is  not  to  be  confused  as  a  nonlinear,  time-invariant  resistor. 
What  do  you  think  h,  the  impulse  response  of  this  equivalent  network,  is? 
How  about  HJHQ1 

The  mixer  loaded  with  a  capacitor  C,  together  with  the  equivalent  net- 
work, is  shown  in  Figure  P4.2.  What  do  you  think  h,  the  impulse  response 
of  this  equivalent  network,  is?  How  about  HXIH01 
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Fig.  P4.2 


4.13.  Repeat  the  calculation  in  sub-section  4.7.3.3,  scenario  2:  special  Vi0 
switching  case,  except  this  time  assume  that  that  there  is  a  strong  noise 
component  at  1.7  GHz,  1.9  GHz  whose  PSD  around  1.7  GHz,  1.9  GHz  is 
given  by  (50  pA)2/Hz.  Calculate  the  NF  and  show  that  it  is  significantly  worse. 
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Chapter  5 
Passive  Mixer 


5.1     Introduction 

In  this  chapter  we  focus  on  passive  mixers.  One  of  the  most  obvious  trade-offs 
between  an  active  and  a  passive  mixer  is  that  of  gain  versus  distortion.  Active 
mixers  provide  gain  and  dissipate  quiescent  power.  A  Gilbert  mixer  is  such  an 
example  that  achieves  gain  through  an  active  predriver  (the  V-I  converter).  This  V-I 
converter  is  highly  nonlinear  and  hence  the  Gilbert  mixer  distortion  performance  is 
worse.  Passive  mixers,  on  the  other  hand,  require  only  dynamic  power.  They  have  a 
conversion  gain  of  less  than  one  (conversion  loss)  but  can  achieve  excellent 
distortion  performance. 

An  active  mixer  is  typically  operated  in  the  continuous  time  domain.  The  Gilbert 
mixer  discussed  in  Chapter  4  is  a  good  example.  For  passive  mixers,  some 
structures  operate  in  the  continuous  time  domain  and  some  structures  operate  in 
the  sampled  data  domain.  A  passive  mixer  that  operates  in  the  continuous  time 
mode  is  denoted  as  a  switching  mixer.  On  the  other  hand  a  passive  mixer  that 
operates  in  sampled  data  domain  is  called  a  sampling  mixer.  Because  of  the 
sampling  nature,  a  sampling  mixer  can  also  operate  in  the  subsampling  mode. 
Structurally  a  switching  mixer  is  usually  terminated  with  a  resistor  and  may  or 
may  not  have  a  capacitor.  On  the  contrary,  a  sampling  mixer  is  always  terminated 
with  only  a  capacitor. 

This  chapter  begins  by  studying  the  switching  mixer,  followed  by  the  sampling 
mixer.  Finally  to  understand  the  sampling  mixer,  another  viewpoint  is  presented  in 
the  appendix:  the  sampling  mixer  is  structurally  the  same  as  an  existing  circuit, 
called  the  sample  and  hold  (SAH).  This  SAH  will  be  used  in  Chapter  6,  as  part  of 
the  analog-to-digital  converter. 
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5.2     Switching  Mixer 

5.2.1     Unbalanced  Switching  Mixer 

The  principle  of  operation  of  a  simple  switching  mixer  is  explained  by  referring  to 
Figure  5.1.  When  V\0  is  positive  M;  is  on,  and  Vn  =  Vr{.  When  Vi0  is  negative  M;  is 
off,  Vjf  =  0.  The  output  signal  appears  to  be  "chopped."  Since  the  amplitude  of  V\0 
exceeds  V+  and  V-  the  output  signal  can  be  mathematically  described  as  the  input 
signal  multiplied  by  the  function  5(t),  where  S(t)  is  described  in  Figure  1.2  of 
chapter  4.  Given  the  mixer  input  signal 


Vrf(t)  =  VRF  COS  COrft 


the  output  voltage  is 


Vif{i)  =  Vrf(t)S(t) 


(5.1) 


VRF  COS  CO,ft 


,  sin1 


n=l 


-cos(na>i0t) 


(5.2) 
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Fig.  5.1    Unbalanced  switching  mixer 
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Fig.  5.2    Double  balanced  switching  mixer 

Notice  that  this  equation  is  essentially  the  same  as  ( 1 .2)  of  Chapter  4,  with  Gq=  I. 
As  in  Chapter  4,  we  use  an  appropriate  bandpass  filter  to  obtain  the  mixed-down 
component  at  coif  =  corf  -  C0io,  which  we  denote  as  V$  again: 


Vifi*)  =  -VRFcos(corf  -  COi0)t 


(5.3) 


Applying  the  definition  of  Gc  as  given  in  Chapter  4  to  (5.3),  we  have 


(5.4) 


Notice  (5.1)-(5.4)  bear  resemblance  to  (1.1)  through  (1.5)  of  Chapter  4.  This 
highlights  the  fact  that  the  switching  mixer  can  also  be  derived  from  the  variable 
gain  model  of  the  mixer  (Figure  1.1,  Chapter  4)  and  should  convince  readers  of  the 
wide  applicability  of  this  model. 

As  with  active  mixers,  switching  mixers  can  be  classified  using  the  balance 
concept.  Under  this  classification  the  simple  switching  mixer  of  Figure  5.1  is 
denoted  as  an  unbalanced  switching  mixer. 


5.2.2     Single  and  Double  Balanced  Switching  Mixer 


A  single  balanced  switching  mixer  can  be  constructed  by  connecting  a  pair  of 
unbalanced  switching  mixers.  If,  in  turn,  we  connect  a  pair  of  single  balanced 
switching  mixers,  we  get  a  double  balanced  switching  mixer,  which  is  a  counterpart 
to  the  Gilbert  mixer.  The  mixer  shown  in  Figure  5.2  is  a  double  balanced  switching 


234  5     Passive  Mixer 

mixer  with  the  isolation  between  the  three  ports  being  achieved  by  means  of 
center-tapped  transformers  (not  shown).  The  transformers  are  idealized  signal  cou- 
pling networks  creating  differential  versions  of  the  input  signals.  Transformers  are 
commonly  used  because  they  are  wideband,  contribute  little  noise,  and  are  linear. 

Qualitatively,  this  is  how  the  double  balanced  mixer  works.  Let  us  assume  that  a 
square  wave  Vlo  is  applied  to  the  LO  port.  As  shown  in  Figure  5.2,  this  Vi0  has  two 
phases,  (f>i  and  cf>2.  On  (f>l  a  positive  voltage  is  applied  to  the  gates  of  MY  and  M3, 
turning  them  on.  At  the  same  time  a  negative  voltage  is  applied  to  the  gates  of  M2 
and  M4,  turning  them  off.  One  can  then  see  Vrf+  is  connected  through  Mx  to  Vif+ 
and  Vyf~  is  connected  through  A/3  to  Vif~.  Consequently,  Vit  =  Vrf.  On  (f>2  the 
situation  is  reversed.  A  negative  voltage  is  applied  to  the  gates  of  M;  and  M3, 
turning  them  off,  and  a  positive  voltage  is  applied  to  the  gates  of  M2  and  M4,  turning 
them  on.  Vr{+  is  now  routed  through  M2  to  Vif~  and  Vrf~  is  routed  through  M4  to 
Vif+.  Therefore,  Vi{  —  —  Vrf.  We  can  thus  see  that  Vif  is  switched  between  the 
positive  and  negative  value  of  Vrf  and  a  switching  operation  is  realized.  Notice  that 
this  switching  action  is  very  similar  to  the  switching  action  described  in  Figure  3.3 
of  Chapter  4.  The  major  difference  is  that  here  voltage  (Vrf),  as  opposed  to  current 
(/,f),  is  being  switched. 


5.2.3    Nonidealities 

The  followings  are  some  of  the  nonidealities  of  switching  mixers  and  their  impact. 

5.2.3.1     Nonlinearity 

Let  us  refer  to  Figure  5.1  again.  Since  the  switch  Mx  has  finite  resistance  upon 
closure,  any  nonlinearity  associated  with  this  resistor  is  the  primary  source  of 
distortion.  Assuming  that  Mx  is  on,  at  low  frequency,  similar  to  (5.10),  Chapter  4, 
the  output  signal  in  Figure  5.1  can  likewise  be  expanded  in  terms  of  the  input  signal. 
Using  Taylor  series  expansion 

Vif(t)  =  axVlf{t)  +  a2V^{t)  +  osV^W  +  •  •  •.  (5.5) 

Now  when  M;  is  switching,  the  output  is: 

Vif{t)  =  [aiV,f(t)  +  a2V?f(t)  +  a3V?f(t)  +  •  •  •]  S(t)  (5.6) 

The  coefficients  of  the  series  can  then  be  used  to  calculate  distortion  in  the  mixer. 
As  with  other  mixer  design,  the  objective  is  to  reduce  a3,  which  is  the  primary 
coefficient  that  determines  the  third-order  intermodulation.  At  high-frequency 
distortion  will  be  described  by  Volterra  series. 
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5.2.3.2     Feedthrough 

As  with  active  mixers,  under  ideal  conditions,  feedthroughs  from  RF  to  IF  port  and 
from  LO  to  IF  port  in  a  double  balanced  switching  mixer  are  zero.  In  practice,  around 
-40  dB  of  feedthrough  occurs.  In  switching  mixers,  there  are  mainly  two  mechanisms 
that  result  in  feedthroughs.  The  first  one  is  capacitive  feedthrough.  This  is  particular 
acute  in  MOS  transistors,  where  parasitic  capacitance  is  significant.  One  major 
coupling  path  is  through  the  gate  to  source/drain  capacitors.  The  second  mechanism 
is  due  to  transistor  mismatches.  Mismatches  can  come  from  W/L  ratios,  VT,  and  other 
process  parameters,  as  well  as  the  actual  physical  layout.  As  the  mismatch  between 
one  transistor  to  its  "balanced"  counterpart  increases,  the  even-order  coefficients 
\ai  in  (5.5)]  differ  and  do  not  completely  cancel,  similar  to  what  happens  in  any 
fully  differential  circuits. 


5.2.3.3     Finite  Bandwidth 

Ideally  the  speed  with  which  the  transistor  can  be  turned  off  is  just  a  function  of  how 
fast  the  channel  charge  can  be  released  and  sets  an  upper  limit  on  the  LO  frequency. 
This  turns  out  to  be  the  transit  time,  which  decreases  as  the  square  of  the  channel 
length.  Hence  for  a  given  technology  (channel  length),  the  maximum  frequency  of 
operation  is  dictated. 

If  the  frequency  of  operation  gets  close  to  this  maximum  frequency,  the  mixer 
suffers  from  the  tracking  error  due  to  finite  bandwidth  imposed  by  the  RC  time 
constant  of  the  switch  and  the  holding  capacitance.  This  error  translates  into  a 
reduction  in  conversion  gain. 

For  a  given  technology  the  use  of  small  R  to  improve  the  bandwidth  leads  to 
large  gate  capacitance,  resulting  in  increasing  rise  time  of  the  LO  waveform  and 
hence  distortion.  The  rise  time  is  dictated  by  the  gate  capacitance  of  the  switch,  the 
voltage  swing  of  the  LO  signal,  and  the  driving  capability  of  clock  or  LO  drivers 
(although  this  driving  capability  is  also  a  function  of  channel  length,  in  an  indirect 
sense).  Intermodulation  distortion  and  conversion  gain  both  degrade  rapidly  with 
increased  rise  time,  due  to  the  transistor  nonlinear  resistance's  dependence  on  the 
gate  drive  voltage.  This,  in  conjunction  with  the  transistor  nonlinear  resistance's 
dependence  on  the  changing  Vr{,  is  the  largest  contribution  toward  distortion.  We 
should  also  note  that  for  a  given  technology  a  small  R  leads  to  a  large  W/L,  and  so 
the  associated  CSb  and  C^,  nonlinear  junction  capacitors,  have  a  higher  contribution 
to  distortion.  By  using  short  channel  length  technologies  one  can  reduce  R  while 
keeping  both  the  gate  and  junction  capacitance  small.  This  will  allow  for  higher  LO 
frequencies  while  satisfying  the  intermodulation  distortion  requirement. 

Next  we  derive  design  equations  that  address  some  of  the  aforementioned  issues. 
Specifically,  we  develop  the  design  equations  that  quantify  the  three  design 
parameters:  conversion  gain  (Gc),  distortion  (IIP3),  and  noise  (NF).  Furthermore, 
all  cases  will  be  covered  for  the  cases  when  V\0  is/is  not  switching. 
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5.3     Distortion  in  Unbalanced  Switching  Mixer 

Referring  to  Figure  5.1  when  M;  is  on  it  may  be  modeled  as  shown  in  Figure  5.3. 
The  transistor  operates  in  the  triode  region  and  is  replaced  by  a  nonlinear  resistor 
Ron.  This  is  the  primary  source  of  intermodulation  distortion  at  low  frequency. 
Note  that  the  arrow  on  the  resistor  symbol  for  Ron  now  indicates  nonlinearity,  not 
time-varyingness.  Other  components  of  importance  include  the  nonlinear  source 
and  drain  to  bulk  capacitors,  Csb  and  Cat,-  They  dominate  the  distortion  performance 
at  higher  frequency.  The  resistor  R  can  come  from  the  50  Q  termination  resistor. 

To  find  the  intermodulation  distortion  of  this  mixer,  we  make  the  following 
assumptions  on  transistor  ML 


5.3.1    Assumptions  on  Model 

a.  On  mobility:  When  on,  the  transistors  in  the  mixer  operate  in  the  triode  region 
with  Vds  =  0.  The  tangential  electric  field  is  low  and  velocity  saturation  is 
assumed  negligible.  The  assumption  of  long  channel  theory  is  still  reasonable. 

b.  On  threshold  voltage:  In  actuality,  threshold  voltage  V,  is  modulated  by  the 
source-to-bulk  voltage  and  will  introduce  additional  non-linearity.  For  the  time 
being  the  threshold  voltage  is  assumed  to  be  constant. 

c.  On  capacitors:  The  capacitor  coefficients  are  found  from  the  expansions  of  the 
junction  and  sidewall  capacitance  equations.  For  CSb  we  have 
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Fig.  5.3    Complete  distortion  model  for  the  unbalanced  switching  mixer 
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Fig.  5.4    Low-frequency  Ron  | 

distortion  model  for  the  V  d  V 

unbalanced  switching  mixer 
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where 

Cj  =  junction  capacitance/area 

CjSW=  sidewall  capacitance/length 

as  —  area  of  source 

pS  =  perimeter  of  source 

VSb  —  source  to  bulk  voltage 

Mj  —  junction  grading  coefficient 

MjSW=  sidewall  grading  coefficient 

(j)  =  bulk  potential 

Notice  that  this  equation  applies  equally  well  to  C^b,  drain  to  bulk  capacitor, 
provided  that  all  the  parameters  are  given  for  the  drain  node. 

With  the  preceding  assumptions  it  can  be  shown  that  at  sufficiently  low 
frequencies  the  distortion  is  mainly  determined  by  the  nonlinear  Ron  of  M1  and 
that  distortion  due  to  CSb  and  C^b  is  quite  small  for  typical  values  of  MOS  model 
parameters.  Even  though  we  have  chosen  to  ignore  body  effect  for  simplicity  in 
analysis,  this  choice  does  not  modify  our  method  of  distortion  analysis  or  the 
conclusions. 

Since  detailed  analysis  of  the  mixer  in  Figure  5.3  has  been  carried  out  in  [1],  only 
the  results  presented  in  this  chapter.  It  would  be  relatively  straightforward  for 
interested  readers  to  apply  the  analysis  in  Sections  5  and  6,  Chapter  4,  to  the  present 
case  and  reproduce  this  detailed  analysis. 


5.3.2    Low-Frequency  Case 

First  we  assume  V\0  is  not  switching  and  so  the  mixer  is  represented  by  an  NLTI 
system.  Let  us  look  at  the  behavior  of  this  NLTI  system  at  low  frequency.  At  low 
frequency  the  model  in  Figure  5.3  can  be  simplified  to  that  in  Figure  5.4,  where  we 
neglect  all  reactive  element  contributions. 
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Therefore,  we  can  use  equation  for  MOS  biased  in  the  triode  region: 

ld  =  k{Vgs-V)Vds-k-V2ds  (5.8) 

where  k  =  k\W/L),  k'  =  fiCox,  and  V,  is  the  threshold  voltage.  Either  the  input  or  the 
output  side  may  be  regarded  as  source.  It  can  then  be  shown,  by  applying  the  Taylor 
series  expansion  [1],  that 

IM3  =  3 - *-A%  (5.9) 

Sk(VGS-Vt)3R    'f 

where  Arf  is  the  amplitude  of  the  RF  input  to  the  mixer. 

Note  that  for  most  receiver  applications  the  undesirable  ID3  is  not  the  one 
generated  by  the  RF  signal.  Rather  it  is  the  /D3  that  is  generated  by  the  interferers. 
Hence  the  IM3  that  corresponds  to  this  /D3  is  given  by 


3  1 


,2 


m  =  8  TK; V^^  °*™*  (5-10) 

»  k(VGS  -  V,)  R 

where  Ainterference  is  the  amplitude  of  the  interference  signal. 

When  comparing  (5.10)  to  (4.32)  of  Chapter  4,  notice  the  difference  in  depen- 
dence on  Vqs  -  Vt. 

Now  by  definition  Ron  =   ,     -.  .  We  can  differentiate  (5.8),  apply  the  result  to 

this  definition  and  we  have 


Ro"    k(vgs  -vt-  vds)  M  k(vGS  ~vt-  vDS)  ■  (5A1) 

If  we  further  assume  VDs  is  small  and  therefore  neglect  the  VDs  term,  then  we  have 


R°"  =  777/ 777  mk  =  ~^~17 77\  (5'12) 

k[VGS  -  Vt)  Rondos  -  Vt) 

In  (5.10), /M3  is  seen  to  depend  onk.  In  (5.12)  ^is  seen  to  depend  onRon.  Hence 
IMt,  depends  on  Ron. 

Finally,  we  look  at  the  case  when  Vlo  is  switching.  First  let  us  assume  that  Vlo  is 
an  ideal  square  wave,  which  means  it  has  zero  rise  time  and  fall  time.  The  situation 
can  be  handled  by  modifying  the  foregoing  results  through  multiplying  the  output 
by  the  Fourier  series  representation  of  the  square  wave.  The  effect  is  simply  shifting 
the  frequency  of  the  fundamental  and  the  third  order  products  by/lo.  The  amplitudes 
are  both  reduced  by  the  same  amount,  1/tt,  with  /M3  remaining  unchanged. 
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When  Via  has  finite  rise  and  fall  time,  at  low  frequency  we  may  assume  that 
the  rise/fall  time  takes  up  a  small  percentage  of  the  total  period.  Then  even  with 
finite  rise/fall  time  the  distortion  formulae  remain  practically  the  same. 


5.3.3    High-Frequency  Case 

First  we  assume  that  Vi0  is  not  switching  and  so  the  mixer  is  represented  by  an  NLTI 
system.  Let  us  look  at  the  behavior  of  this  nonlinear  system  at  high  frequency. 
At  high  frequency,  the  reactive  elements  coming  from  the  junction  capacitors 
contribute  to  third-order  distortion  and  Volterra  series  must  be  invoked.  To  include 
capacitor  effects  we  should  go  back  to  Figure  5.3.  To  simplify  the  analysis  let 
us  arbitrarily  assume  for  the  time  being  that  in  Figure  5.3  the  non-linear  capacitor 
Cdb  at  the  drain  is  replaced  by  a  linear  capacitor  C  (CSb  at  the  source,  of  course,  does 
not  matter,  since  it  is  driven  by  a  voltage  source  Vrf).  Furthermore,  let  us  assume  that 

R>R0„  (5.13) 

and 

1 

»  R0„  or 

JCOrfC 

'  <frf  (5.14) 


2nRollC 

Applying  these  simplifying  assumptions  to  the  model  in  Figure  5.3,  an  expres- 
sion for/M3  due  to  interferers  can  be  found  by  applying  Volterra  series  analysis  [1]: 


m      Vg+Kg 

IM3  = ~ (5.15) 

8*(KGs  -  V,)3 


A2 

interference 


Note  that  this  reverts  to  (5.10)  when  cort  =  0,  as  expected.  IM 3  can  also  be  related 
to/M3|lowfrequency  via 


M3  =  IM3\lowfreqJl  +  (^f^j  (5.16) 

where  /M3  |low  frequency  is  the  predicted  third-order  intermodulation  at  low  frequency 
as  given  in  (5.10). 

Finally,  we  look  at  the  case  when  V\0  is  switching.  Again,  we  first  assume  that 
V\0  is  an  ideal  square  wave  which  means  it  has  zero  rise  time  and  fall  time.  As  in  the 
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low-frequency  case,  the  situation  can  be  handled  by  modifying  the  foregoing  results 
through  multiplying  the  output  by  the  Fourier  series  representation  of  the  square 
wave.  The  effect  of  the  switching  is  simply  shifting  the  frequency  of  the  funda- 
mental and  the  third-order  products  by/lo.  The  amplitudes  are  both  reduced  by  the 
same  amount  I/71,  with  IM3  remains  unchanged. 

Now  let  us  look  at  the  situation  when  V\Q  has  finite  rise  and  fall  time.  Since  at 
high  frequency  the  rise/fall  time  no  longer  takes  up  a  small  percentage  of  the  total 
period,  we  may  have  to  resort  to  a  technique  called  the  "variable  state  approach"  [1] 
to  calculate  the  distortion.  This  is  based  on  a  two-dimensional  Volterra  series.  For  a 
simple  circuit  like  a  switching  mixer,  one  general  observation  can  be  obtained  [1]: 
It  is  seen  that  IM3  increases  as  the  ratio  tf/T,  where  tf  is  the  fall  time  and  T  is  the  LO 
period. 

Numerical  example  5.1. 

Suppose  an  IIPj,  (third  order  output  intercept  point)  of  +35  dBm  is  required  for  a 
mixer  and  we  want  to  design  this  mixer  to  operate  at  low  and  high  frequency  (frf  = 
1.9GHz).  We  are  given  R  =  50ft,  and  VGS  -  V,  =  4  V,  L  =  1  um,  k'  =  100  uA/V2. 
Cdb  is  assumed  to  be  a  linear  capacitor  with  CjSW  =  0,  Cs  =  2  fF/um  ,  with  its  area  ad 
given  by  ad  =  W  x  drain_height.  Drain_height  is  given  to  be  6um. 

a.  Low  frequency  case 

For  simplicity  let  us  assume  that  Vlo  is  not  switching.  Since  the  mixer  is  operating  at 
low  frequency,  we  use  (5.10)  to  design  for  the  W/L.  To  use  (5.10)  we  need  to  relate 
IMt,  to  IIP3.  This  could  have  been  done  by  applying  (2.38)  from  Chapter  2. 
Alternatively,  we  can  simply  apply  the  definition  that  when  /"interference  =  HP3, 

IMi  =  1  (5.17) 

Since  we  are  given  IIP3  —  35  dBm,  then  P;nterference  =  35  dBm  when  condition 
(5.17)  applies.  Hence  when  condition  (5.17)  applies,  Ainterference  is  given  by 

+  35^=101og2xA^TT()_3 

^interference  =  17.8V  (5.18) 

Substituting  (5.17),  (5.18),  and  other  given  parameters  into  (5.10),  we  have 

1  =  3 ^2^1 (5  19) 

8  lOQuA/V1  x  f  x  43  •  50Q 


or 
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Fig.  5.5    Layout  of  switch  M[ 


Solving,  we  have 


W 


=  371 


(5.20) 


b.  High  frequency  case 

For  simplicity  let  us  assume  that  Vlo  is  not  switching.  Since  the  mixer  is  operating  at 
high  frequency  we  can  use  (5.15)  to  design  for  the  W/L. 

To  use  (5.15)  let  us  first  determine  C,  or  C^b  of  transistor  M\_  This  can  be  found 
from  (5.7),  with  source  replaced  by  drain  in  all  the  parameters,  which  is  rewritten  as: 


Cjsw  x  pd  Cj  x  ad 


(5.21) 


where 


Cj  =  junction  capacitance/area 

CjSW=  sidewall  capacitance/length 

ad  =  area  of  drain 

pd  =  perimeter  of  drain 

Vdb  =  drain  to  bulk  voltage 

Mj  —  junction  grading  coefficient 

Mjsw=  sidewall  grading  coefficient 

<fi  =  bulk  potential 

We  are  given  CJSW  =  0  and  hence  the  first  term  is  gone.  Also  C^b  is  assumed  to  be 
a  linear  capacitor  and  hence  the  denominator  of  the  second  term  becomes  one. 
Consequently,  the  preceding  equation  becomes 


Cdb  =  Cj  x  ad 


(5.22) 


To  calculate  ad,  we  refer  to  the  layout  of  transistor  Mi,  which  is  shown  in  Figure  5.5. 
Here  L  is  seen  to  be  lum,  as  given.  Also  we  can  see  ad  =  W  x  drain_height. 
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This  drain_height  is  normally  dictated  by  layout  constraint  and  is  given  as  6u. 
Substituting  the  Cj  given  and  drain_height  =  6  urn  in  (5.22)  we  have 

Cdb  =  2fF/um2  x  W  x  6um  (5.23) 

Now  we  are  also  given 

/rf  =  l.9GHz  (5.24) 

Substituting  (5.18),  (5.23),  (5.24),  and  other  relevant  parameters  into  (5.15),  we 
have 

(17.8V)2y/(^)2+(27r  x  1.9GHz  x  W  x  6u  x  2fF/u2)2 
8x  100mA/V2x^(4v)3 

Solving  W  =  762um.  Hence 

This  is  about  twice  as  big  as  the  M]  designed  using  the  low  frequency  formula 
[see  (5.20)]. 

Finally,  we  should  check  whether  the  assumptions  leading  to  the  high-frequency 
model  as  described  in  (5.14)  are  valid. 


5.4     Conversion  Gain  in  Unbalanced  Switching  Mixer 

We  proceed  by  developing  the  conversion  gain  formula  when  Vlo  is  not  switching 
Then  we  develop  the  conversion  gain  formula  when  V\0  is  switching. 


5.4.1     Via  Not  Switching  Case 

The  mixer  in  Figure  5. 1  is  modeled  such  that  the  channel  resistance  Ron  is  replaced 
by  a  linear  resistor  in  series  with  an  ideal  switch.  The  nonlinear  substrate  capacitors 
are  omitted  since  they  do  not  have  an  impact  on  performance  except  at  very  high 
frequencies.  The  final  model  is  shown  in  Figure  5.6  [1],  where  C  is  the  equivalent 
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Fig.  5.6    Conversion  gain  model  for  the  unbalanced  switching  mixer 


capacitance  of  C?s  with  Cg^.  Notice  that  this  C  is  different  from  the  C  in  sub-section 
5.3.3.  We  first  develop  the  conversion  gain  by  assuming  that  the  switch  is  on  and 
treating  the  mixer  as  a  simple  LTI  circuit.  Hence  Gc  becomes 


Gc  = 


R 


ON\ 


R 


jtOrfC 


(5.27) 


5.4.2     Special  Vto  Switching  Case 

Now  with  switching,  if  the  poles/zeroes  frequencies  are  much  higher  in  frequency 
than  /i0,  the  mixer  satisfies  (4.150)  of  Chapter  4.  Hence  the  conversion  gain  is 
modified  by  the  Fourier  series  coefficient  and  it  becomes 


1 


R 


TtRf 


1  J0J,fC 


R 


(5.28) 


Taking  finite  rise  and  fall  time  into  consideration,  expression  (5.28)  is  modified, 
which  involves  the  factor  (tr/T)  [1],  where  tr  is  the  rise  time  and  T  is  the  LO  period. 
The  impact  is  usually  small. 


5.4.3     General  Vi0  Switching  Case 


When  fi0  increases  such  that  the  poles/zeroes  frequencies  become  comparable  to 
fio,  the  effect  of  poles/zeros  must  be  considered.  This  is  because  the  mixer  no  longer 
satisfies  (4.150)  of  Chapter  4.  Our  goal  in  this  sub-section  is  to  see  what  happens  to 
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(5.28)  under  this  situation.  First  we  note  that  the  conversion  gain  of  a  mixer  under 
the  general  Vlo  switching  case  has  actually  been  covered  in  sub-section  4.7.2.1, 
Chapter  4.  In  that  section  and  specifically  in  (4. 135)  the  formula  for  Hn,  the  transfer 
function  of  an  LPTV  system,  is  given.  From  (4.135),  Chapter  4,  it  is  seen  that  //; 
relates  the  input  at  tolo  to  the  output  at  coif.  This,  according  to  the  definition  of 
conversion  gain,  is  exactly  Gc.  We  further  explain  in  sub-section  4.7.2.1  that 
closed-form  solutions  of  HX{GC)  for  a  typical  LPTV  system  are  hard  to  determine. 
However,  the  present  circuit  is  simple  enough  that  a  closed-form  solution  can  be 
obtained.  To  derive  Gc  basically  we  start  by  looking  at  the  two  phases  of  the  LO 
signal  and  sum  the  impulse  responses.  During  cj)i,  the  switch  is  off,  in  Figure  5.6a, 
and  the  mixer  has  one  impulse  response.  During  4>2,  the  switch  will  be  closed  and 
the  mixer  has  another  impulse  response.  The  complete  conversion  gain  is  then  a 
weighted  sum  of  the  two  responses.  It  is  derived  in  [1]  and  given  as 


Gc 


l+#to[T2(l+AT)+Ti(l-fl:) 

(1  +J2nfi0r1)(l  +j2itfi0t2) 


(5.29) 


where 


K  = -, r- ,  and 


l-e 


Vh 


%x  =  CR,  T2  =  C(R//Ron)-  The  terms  t1;  t2  correspond  to  the  two  time  constants 
during  the  two  phases,  when  the  switch  is  open  and  closed  respectively.  For  DECT 
application,  Gc  as  calculated  by  (5.29)  can  be  substantially  less  than  that  calculated 
by  (5.28).  This  is  particularly  significant  for  passive  mixers  since  Gc  is  small  to 
begin  with. 

The  magnitude  of  K  is  never  greater  than  one.  The  range  is  0  <  \K\  <  1  and  is 
positive  for  Ti>t2  and  negative  for  Ti<t2. 

As  a  special  case,  let  us  start  decreasing /lo.  Then  T  starts  to  increase.  If  T  has 
increased  to  such  a  point  that  the  time  constants  are  much  shorter  than  the  period  of 
the  LO  signal  then  xJT  and  t2/T  — >  0  or  Zif\0  and  ij/io  — >  0.  Equation  (5.29)  reduces  to 


71  V  Ti 


1 


R 


nR, 


ON 


R 


(5.30) 
(5.31) 


Decreasing/i0  means  that  the  poles/zeroes  frequencies  are  once  again  becoming 
much  higher  in  frequencies  than/lo  and  we  are  reverting  to  the  scenario  as  depicted 
in  sub-section  5.4.2.  We  should  then  get  the  same  Gc  formula.  Let  us  revisit  the 
Gc  formula  in  sub-section  5.4.2,  which  is  given  by  (5.28).  As  /lo  decreases, 
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frf  decreases.  In  (5.28)  the  l/corfC  term  becomes  much  longer  than  the  Ron  term  and 
can  be  neglected.  Equation  (5.28)  then  becomes 

1        R 
Gc 


n  Ron  +  R 


and  indeed  agrees  with  (5.31). 

The  expression  incorporating  finite  rise  and  fall  time  was  not  derived  because  it 
is  too  complicated  and  involves  infinitely  many  phases  and  circuit  configurations. 

As  a  summary  of  Section  5.4  it  should  be  noted  that  Gc  derived  in  sub-section 
5.4.1  through  5.4.3  are  all  less  than  one,  which  is  consistent  with  the  fact  that  the 
mixer  is  passive.  This  is  also  the  essential  difference  from  an  active  mixer,  such  as  a 
Gilbert  mixer,  whose  gain  is  larger  than  one. 


5.5     Noise  in  Unbalanced  Switching  Mixer 

We  proceed  by  developing  the  noise  expression  without  switching.  Then  we 
develop  the  noise  expression  with  switching. 


5.5.1     Vto  not  Switching  Case 

To  calculate  the  noise  without  switching,  we  normally  assume  that  the  switch  is 
closed.  Then  from  Figure  5.6b,  the  equivalent  resistance  is  Ron\  \R.  Now  the  output 
noise  PSD  can  be  calculated  from  (4.34)  of  Chapter  2  and  is  given  by 

S„=  4kT(Ron\\R)  (5.32) 

Input  referred  noise  PSD,  Ndev,  is  tnen  given  by 

NdeY  =  ^  (5.33) 

where  Gc  is  given  from  (5.27)  and  Sn  is  given  by  (5.32).  Finally,  we  want  to 
calculate  NF.  First  we  repeat  (2.67)  of  Chapter  2: 

A,r^        W Device    >    (jc  '  i\ Source-resistance  ,r  ~>  a\ 

NF  = — — (5.34) 

*^C  '  -'* Source '_ resistance 
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Fig.  5.7    Noise  model  for  the  unbalanced  switching  mixer 


Substituting  Ndev  —  Ndevice/G  into  (5.34),  we  have 
NF=  1 


Ndey 
I*  Source  jresis  tan  ce 


(5.35) 


Substitute  the  ATdev  we  obtained  in  (5.35),  assuming  a  source  resistance  of  50  Q, 
and  the  NF  is  determined. 


5.5.2     Special  Vi0  Switching  Case 

First  let  us  follow  the  discussion  in  Figure  4.17,  Chapter  4  and  represent  the  present 
switching  mixer  in  a  similar  way.  The  resulting  representation  is  shown  in  Figure  5.7, 
Where  there  is  an  input  noise  PSD,  denoted  as  5nl,  and  an  output  noise  PSD, 
denoted  as  Sn2.  Applying  (4.134),  Chapter  4,  to  Figure  5.7,  we  have 


S„2  (a>if) 


\H»(coif)\  S„i(coif  +  n(Di0) 


(5.36) 


In  sub-section  4.7.2.2,  Chapter  4,  we  have  outlined  how  to  calculate  Hn  under  the 
special  V\0  switching  case;  that  is,  when  the  poles/zeroes  frequencies  are  much 
higher  than  flo.  Readers  can  apply  the  same  principle  to  the  present  case  and 
calculate  Hn,  and  hence  Sn2,  in  (5.36). 


5.5.3     General  Vi0  Switching  Case 


What  about  the  case  when  the  mixer  behaves  as  a  general  LPTV  system;  that  is 
when  the  poles/zeroes  frequencies  become  comparable  to/lo? 

Usually  a  closed-form  solution  for  Hn  is  rather  difficult  to  calculate  in  this  case. 
However,  the  present  circuit  is  simple  enough  that  a  closed-form  solution  can 
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indeed  be  obtained.  Reference  [1]  has  worked  out  these  Hn  expressions,  and  the 
resulting  Sn2  is  calculated  to  be 


S„2  =  4kT{  ^— ^  -fi0C(Ri  -  Rif 


1  -  e  2''»'i      1  -  e  2f'°^ 


i-^M 


(5.37) 


Here  Rx  —  R  (corresponding  to  resistance  across  the  capacitor  during  phase  1, 
when  the  switch  is  connected)  and  R2  —  RQa\  I U?  (corresponding  to  resistance  across 
the  capacitor  during  phase  2,  when  the  switch  is  open).  Sn2  can  also  be  expressed  in 
terms  of  a  equivalent  resistance  Req  as 

Sn2  =  4kTReg  (5.38) 

As  a  special  case  notice  when  f[o  =  0,  (5.37)  reduces  to 

S,a  =  4kTRl+2Rl  =  4kTR  +  (R™WR)  (5.39) 

What  does  fi0  =  0  mean?  It  means  that  either  the  switch  is  on  or  off  all  the  time. 
Notice,  from  (5.32),  that  the  output  noise  PSD  for  the  case  when  the  switch  is  on  is 
4kT(Ron\\lR). 

What  about  the  case  when  the  switch  is  off?  If  we  assume  that  the  switch  is  off, 
then  the  output  noise  PSD  is  given  by 

S„2  =  AkTR  (5.40) 

Hence  the  Sn2  for  these  two  cases  are  4kT(Ron|  1 1R)  and  4kTR  respectively,  and 
therefore  the  total  Sn2  should  be  the  average  of  the  two,  which  agrees  with  (5.39). 
Again  input-referred  noise,  NAcy,  is  given  by 

Nder  =  ||  (5.41) 

where  Gc  this  time  is  given  by  (5.29)  and  5n2  is  given  by  (5.37).  NF  can  now  be 
calculated  by  substituting  the  A'dev  we  obtained  in  (5.35). 

Finally,  the  expression  incorporating  finite  rise  and  fall  time  was  not  derived  as  it 
is  too  complicated  and  involves  infinitely  many  phases  and  circuit  configurations. 


5.6    A  practical  Unbalanced  Switching  Mixer 

A  practical  realization  of  the  unbalanced  switching  mixer  is  shown  in  Figure  5.8. 
This  mixer  is  shown  together  with  an  IF  amplifier  (M2)  in  the  figure.  Transistor  M; 
is  the  basic  switch.  The  incoming  RF  signal  is  chopped  by  the  switching  action  of 
the  LO  signal,  turning  Mx  on  and  off.  The  IF  signal  is  produced  by  Mx  and  amplified 
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v«   o 


Fig.  5.8    Switching  mixer  and  IF  amplifier 


by  M2.  When  M;  is  on,  it  behaves  like  a  small  resistor  (<  10  Q).  Transistor  M3  sets 
the  bias  voltage.  For  a  Vdd  of  5  V  VD3  is  around  1.3  V.  M4  and  M5  are  biased  in  the 
triode  region  and  hence  behave  as  large  resistors  (since  W/L  are  very  small). 

Now  let  us  see  how  the  circuit  operates.  First  let  us  take  a  look  at  Vgsi-  We  look  at 
Vgi  first-  Assume  first  that  the  voltage  at  the  LO  port  is  zero  and  therefore  does  not 
provide  any  DC  current,  then  ID4  —  IR  —  /G1  =  0.  Hence  VGX  —  VD4  =  VDt,  =  1.3  V. 
Next  let  us  look  at  VS1.  Assume  that  the  RF  port  does  not  provide  any  DC  current. 
Then  ID5  =  0  and  so  VS1  =  VS5  =  VDj,  =  1.3  V,  again.  Therefore,  VGS1  =0V  and  M; 
is  nominally  off. 

Second,  let  us  assume  that  the  voltage  at  the  LO  port  goes  up.  M;  is  turned  on  as 
soon  as  Vlo  goes  up  and  becomes  at  least  one  V,  above  1.3V.  Notice  the  full  driving 
capability  of  Vlo  is  not  used  (only  the  V\0  —  Vt  portion  is  used). 

Thirdly,  let  us  now  ask  why  we  need  M4,  M5,  Ci,  C2,  Cext.  They  are  necessary  for 
a  couple  of  reasons.  To  examine  these  reasons  let  us  replace  M4,  M5,  Ci,  C2  by  their 
equivalent  impedance  operating  at  a  frequency  of  1.9  GHz  (using  DECT  as  an 
example  again).  The  resulting  circuit  is  shown  in  Figure  5.9. 

Now  we  can  give  the  reasons  as  follows: 

1 .  The  LO  port  cannot  be  connected  directly  to  node  1 .  It  has  to  be  coupled  through 
M4.  Imagine  if  the  LO  port  (gate  of  Mx)  is  connected  directly  to  node  1.  Then  the 
resistance  seen  by  Vlo  is  practically  dominated  by  C\,  C2-  At  1.9  GHz  this  is 
about  2  Q,  and  50  Q  matching  is  not  achieved.  To  overcome  this,  Vi0  is 
connected  through  Ru  M4  to  node  1.  As  shown  in  Figure  5.9  the  resistance  at 
the  LO  port  is  now  practically  dominated  by  Ru  or  50  Q.  Hence  a  50  Q  match  is 
achieved. 

2.  The  network  C\,R\,  M4,  C2  is  there  to  help  attenuate  the  LO  to  RF  feedthrough. 
Going  from  the  LO  port  to  the  RF  port,  we  can  see  that  the  network  consists  of  a 
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Fig.  5.9    Switching  mixer  and  IF  amplifier  redrawn  with  M3,  M4,  M5,  C,,  C2  replaced  by  their 
equivalent  impedance 


2  RC  cascaded  network.  Neglecting  resistance  — — ,  R2,  Rm  (which  are  assumed 

£mM3  5 

large  compared  to  the  impedance  of  C2),  the  2  RC  cascaded  network  consists  of 
C2Rm4  driving  CiRi.  To  simplify  the  analysis,  let  us  ignore  the  loading  effect 
between  these  2  RC  sections.  Hence  the  two  poles  of  this  network  can  be 
approximately  given  as  p\  —  2ji^  c  —  159  MHz  and  p2  —  2nR[  c,  —  1-5  MHz. 

These  poles  are  at  sufficiently  low  frequencies  that  they  will  provide  significant 
attenuation  for  the  LO  signal  switching  at  close  to  1 .9  GHz.  Notice  that  since  the 
amplitude  of  the  LO  signal  is  large  and  that  of  the  RF  signal  is  small,  it  is  only 
necessary  to  provide  filtering  in  one  direction  (from  the  LO  port  to  the  RF  port). 
3.  Cexl  is  used  to  improve  RF  to  IF  isolation.  Referring  to  Figure  5.8  notice  that 
when  Mi  is  on,  there  is  an  RC  filter  from  the  RF  port  to  the  IF  port.  Here  R  is  the 
Ron  of  switching  transistor  M;  and  C  consists  of  Cexl  in  parallel  with  Cdbi  and 
CgS2-  This  RC  filter  helps  filter  out  the  RF  component.  Obviously,  the  larger  Cext 
is,  the  more  effective  the  filtering  is.  Hence  what  is  limiting  the  value  of  this 
Cext?  It  turns  out  that  distortion  leads  to  a  ceiling.  Referring  to  (5.15),  we  can  get 
the  IMj,  for  the  present  unbalanced  mixer  by  setting  R  in  that  equation  to  infinity. 

A2     ,        m,fC 

The  resulting  expression  is  IMt,  —    '"',,"""',,,3  ■  Here  C  consists  of  Cext  in  parallel 

tsk(VGs-V,) 

with  Cdbi  and  C„S2-  Vqs  —  ^gsi-  Notice  IM3  increases  with  increasing  Cext. 
Hence  for  a  given  /M3  (from  specifications)  the  maximum  Cext  is  set. 


5.7     Sampling  Mixer 


We  have  previously  investigated  switching  mixer  and  discussed  it  in  an  application 
as  a  mixer  preceding  an  IF  amplifier.  For  this  mixer  the  output  is  processed  in  the 
continuous  time  domain.  An  alternative  structure  consists  of  terminating  the  output 
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with  a  capacitor  only  (and  no  resistor).  In  addition,  the  mixer  output  is  only  looked 
at  on  the  falling  edge  of  the  Vlo  signal.  This  is  called  a  sampling  mixer. 

Sampling  mixers  are  useful  for  applications  where  the  circuits  that  they  feed  into 
consist  of  sampled  data  circuits.  These  may  include  A/D  converters  and  IF 
amplifiers  implemented  in  sampled  data  domain  (e.g,  IF  amplifiers  implemented 
using  a  switched  capacitor  network).  Some  of  these  circuits  will  be  discussed  in 
Chapter  6  on  A/D  converters.  In  addition,  sampling  mixers  allow  mixing  to  be 
operating  in  a  special  mode,  called  subsampling,  which  can  lead  to  simplified 
design.  The  subsampled  sampling  mixer  is  discussed  later  in  this  chapter. 

To  characterize  the  sampling  mixer  we  could  have  taken  the  formulae  derived 
for  distortion,  noise  and  conversion  gain  in  Section  5.3  through  5.5  for  switching 
mixers,  incorporated  the  sampling  effect  (which  involves  aliasing  of  distortion 
and  noise  components),  and  redeveloped  all  the  formulas.  For  some  charact- 
eristics(like  conversion  gain)  this  is  exactly  what  we  do.  However,  for  some  other 
characteristics  (like  distortion  and  noise)  this  can  become  rather  cumbersome  and 
we  decide  not  to  do  that.  Instead,  since  the  sampling  mixer  operates  in  the  sampled 
data  domain,  we  choose  to  develop  an  understanding  of  such  characteristics  of  the 
circuit  directly  in  the  time  domain.  This  seems  natural,  as  we  should  remember  that 
one  of  the  original  reasons  for  performing  a  frequency  domain-based  analysis  on 
the  switching  mixer  stems  from  the  fact  that  a  switching  mixer  operates  in  the 
continuous  time  domain.  Another  reason  for  the  change  in  the  domain  of  analysis 
for  some  characteristics  stems  from  the  fact  that  as  we  are  only  interested  in  the 
output  value  at  the  sampling  instant,  the  behavior  of  the  sampling  mixer  during  the 
falling  edge  of  the  LO  waveform  becomes  critical.  The  behavior  along  the  falling 
edge  can  affect  the  circuit  more  significantly  than  in  the  case  of  a  switching  mixer. 
It  turns  out  that  the  value  of  distortion  and  noise  due  to  finite  fall  time  alone  can 
become  totally  dominant,  and  hence  we  cannot  afford  to  neglect  these  effects.  Since 
the  effect  of  finite  fall  time  is  more  naturally  handled  in  time  domain,  this  change  in 
the  domain  of  analysis  is  fully  justified. 


5.7.1     Qualitative  Description 

As  a  start,  let  us  look  at  Figure  5.10  which  shows  a  single-ended  sampling  mixer. 
This  is  similar  to  the  unbalanced  switching  mixer.  Analogous  to  the  single  and 
double  balanced  switching  mixers,  there  are  differential  and  cross-coupled  differ- 
ential sampling  mixers  [3].  Referring  again  to  Figure  5.10,  notice  that  the  input  Vrf 
consists  of  a  baseband  signal  superimposed  on  a  carrier.  Initially  (during  the  track 
mode  when  Vi0  is  high),  Vif  (dotted  line)  tries  to  follow  V^.  At  the  end  of  the  track 
mode,  Vio  drops  to  0,  Ml  is  switched  off  and  the  voltage  Vif  on  the  capacitor  Cioad  is 
held  (solid  horizontal  line).  This  stays  until  Vlo  goes  high  again,  when  we  enter  the 
next  track  mode  and  the  operation  repeats  itself.  Let  us  now  look  at  Vif  at  the 
sampling  instant  only  [Vif  (sampled),  solid  dots]  and  record  those  values.  Then  this 
Vif  (sampled)  is  seen  to  exhibit  a  much  lower  frequency  than  the  carrier  and 
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Fig.  5.10    Single-ended  sampling  mixer  and  sampling  operation 
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Fig.  5.11    Single-ended  sampling  mixer  and  sampling  operation:  only  envelope  of  input  shown 


effectively  a  frequency  translation  (mixing)  has  occurred.  On  closer  examination 
it  can  be  shown  that  this  V;f(sampled)  is  exactly  the  baseband  signal  of  the  original 
Vrf  and  therefore  Vif  (sampled)  is  essentially  the  IF  signal  we  want. 

Ideally  the  envelope  (the  baseband  signal),  rather  than  the  carrier  of  Vrf,  contains 
all  the  essential  information,  and  hence  let  us  for  the  time  being  neglect  the  carrier 
and  focus  on  the  concentrate  on  the  envelope.  We  try  to  redraw  Figure  5.10  in 
Figure  5.11  with  this  simplification.  First,  the  negative  peaks  of  Vrt  in  Figure  5.10 
are  extrapolated  and  used  to  construct  an  envelope  of  Vrf.  Next  to  save  inventing 
new  symbols  this  envelope  is  also  denoted  as  Vrf  from  now  on.  This  Vrf  is  shown  in 
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Fig.  5.12    Single-ended  sampling  mixer  and  sampling  operation:  nonideal  effects  highlighted 

Figure  5.1 1,  which  is  seen  to  be  slowly  varying.  If  this  slowly  varying  Vrt  is  being 
applied  to  the  input  of  the  sampling  mixer,  during  the  track  mode  V#  follows  V^ 
closely.  During  the  hold  mode  the  solid  dots  show  the  Vif  (sampled). 

Because  Vrf  is  varying  slowly,  within  one  LO  period  Vrf  is  almost  constant.  Thus 
to  analyze  this  circuit  we  can  treat  the  transistor  M;  as  a  simple  on  off  switch  with 
an  almost  constant  input  voltage.  It  passes  charge  to  the  sampling  capacitor  when  in 
the  track  mode,  and  preserves  the  charge  when  in  the  hold  mode. 


5.7.2    Nonidealties 

Under  ideal  condition  Vif(sampled)  should  equal  Vrf  at  the  sampling  instant  and  the 
mixing  operation  recovers  the  baseband  signal  perfectly.  There  are,  however,  a  few 
nonidealities  that  make  this  perfect  recovery  impossible.  Figure  5.11  is  redrawn  in 
Figure  5.12  to  illustrate  this. 


5.7.2.1     Finite  Bandwidth 


As  seen  in  Figure  5.12,  during  tracking  V#  follows  Vrf  closely,  but  not  perfectly. 
This  is  because  of  the  non-zero  drain-source  resistance  Ron  of  the  MOS  device. 
The  effect  of  Ron  is  to  create  an  R-C  lowpass  filter  with  capacitance  equal  to  Cioad  in 
parellel  with  Cdb.  This  introduces  a  delay.  Hence  at  the  sampling  instant  Vif 
(sampled)  is  not  the  same  as  Vrt  at  that  instant  and  an  error  is  introduced.  This 
effect  is  similar  to  the  finite  bandwidth  effect  of  switching  mixers  described  in  sub- 
section 5.2.3.3,  except  that  now  we  are  interested  in  the  error  at  the  sampling  instant 
only.  Again,  this  error  leads  to  a  reduction  in  conversion  gain. 
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5.7.2.2     Nonlinearity 

During  the  track  mode,  the  nonlinear  Ron  introduces  distortion  in  much  the  same  way 
as  described  in  sub-section  5.2.3. 1 .  As  a  result,  the  7M3  can  be  predicted  by  developing 
similar  equations.  For  example,  (5.15)  can  be  modified  to  predict  /M3  of  a  sampling 
mixer  at  high  frequency  by  setting  R  —  oo  and  C  =  C^  +  Cioa<j  in  that  equation. 

It  can  be  seen  in  Figure  5.12  that,  as  opposed  to  the  ideal  case,  Vlo  has  a  finite  fall 
time  during  the  sampling  instant.  As  the  switch  moves  from  track  mode  to  hold 
mode  during  this  finite  fall  time,  it  will  introduce  extra  distortions.  As  opposed  to 
switching  mixers  (which  also  suffer  from  extra  distortions  due  to  finite  fall  time), 
extra  distortions  can  dominate  the  total  distortion.  This  is  because  these  extra 
distortions  occur  around  the  sampling  instant,  a  moment  most  critical  to  sampling 
mixers.  These  extra  distortions  one  discussed  in  more  detail  later. 


5.7.2.3     Feedthrough 

This  nonideality  is  similar  to  the  feedthrough  described  in  sub-section  5.2.3.2, 
except  that  the  critical  feedthrough  is  the  one  that  happens  during  the  sampling 
instant.  Hence  the  feedthrough  from  LO  at  the  sampling  instant  is  most  important. 
At  that  instant  feedthrough  from  the  LO  is  due  to  the  existence  of  the  capacitive 
divider,  which  consists  of  the  gate  to  drain  capacitance  Cad  and  Cioa±  On  the  falling 
edge  of  V\0  the  change  in  Vlo  (assumed  going  from  Vdd  to  0)  results  in  a  corresponding 
change  of  Vjf  given  by 

^feedthrough  =  (Cgd/  (Cload  +  Cgd))  Kid  (5-42) 

This  error  is  one  of  the  factors  that  contribute  to  the  droop  of  Vif  at  the  sampling 
instant.  The  droop  is  evident  in  the  Vif  waveform  around  the  sampling  instant  in 
Figure  5.12.  Now  since  this  error  is  fixed,  the  resulting  error  is  a  DC  offset.  For 
differential  and  cross-  coupled  differential  mixers,  this  is  not  an  issue. 


5.7.2.4     Charge  Injection 

This  nonideality  is  unique  to  sampling  mixers.  The  concept  is  borrowed  from 
sampled  data  circuits  where  the  circuit  designers  are  concerned  with  the  effect  of 
the  channel  charge  dumping  upon  the  turn-off  of  the  switch.  This  error  charge  is 
dumped  onto  the  storage  capacitor,  Cioaij,  and  adds  an  offset  voltage  from  the  actual 
VTf.  The  amount  of  charge  stored  in  the  channel  is  a  function  of  Vrf  (the  magnitude 
varies  with  the  frequency).  When  the  switch  moves  from  an  on  to  an  off  state  with  a 
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Fig.  5.13    Bottom-plate  single-ended  sampling  mixer 


slow  Vio  edge  transition,  roughly  half  of  the  charge  is  transferred  to  the  load.  This 
results  in  an  error  in  Vif  [and  Vif  (sampled)]  equal  to 


Qchan    =  WiLiCpxjVdd  ~  Vrf  -  Vr) 
*^  load  ^^  load 


(18) 


Here  QChan  is  the  channel  charge.  This  error  is  the  other  factor  that  contributes  to 
the  droop  of  Vif  at  the  sampling  instant.  This  error  is  dependent  on  Vrf,  so  this  error 
is  translated  into  signal-dependent  distortion.  Differential  and  cross-coupled  differ- 
ential mixers  still  suffer  from  this  distortion. 


5.7.2.5     Jitter 

Notice  in  Figure  5.12  that  during  the  turn-off  process,  in  addition  to  having  a  finite 
slope,  the  clock  edge  is  uncertain.  This  uncertainty  in  clock  edge  is  denoted  as  jitter 
in  Figure  5.12.  The  uncertainty  causes  the  exact  instant  of  storing  the  Vrf  to  have  an 
uncertainty  as  well.  This  uncertainty  manifests  itself  as  noise  in  the  sampling  mixer. 
This  is  an  aspect  unique  to  a  sampling  mixer. 


5.7.3    Bottom-Plate  Sampling  Mixer 


To  combat  charge  injection  error,  an  improved  sampling  mixer  topology  is  as 
shown  in  Figure  5.13.  In  this  circuit  a  technique  known  as  bottom-plate  sampling 
is  employed  to  remove  the  signal  dependence  in  the  error  term. 

The  input  is  Vrf  and  Vif  is  taken  as  the  voltage  across  Cload.  The  LO  signal  Vlo  is 
applied  to  switch  Mx  and  a  signal  Vi0_bottom  is  applied  to  switch  M2.  Both  switches 
are  implemented  using  MOS  transistors.  Notice  both  Vlo  and  Vi0_bottom  are  the  same, 
except  that  Vi0_bottom  has  its  falling  edge  slightly  advanced  compared  to  that  of  Vlo 
During  tracking,  both  M ;  and  M2  are  on,  so  Vlf  is  sampled  onto  Cioa<].  When  the 
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Fig.  5.14    Subsampling  operation 

mixer  goes  from  the  track  mode  to  the  hold  mode,  because  of  the  slightly  advanced 
falling  edge  of  Viobottonw  M2  is  opened  first,  which  effectively  freezes  the  amount  of 
charge  stored  on  Cioaij.  Then  the  top  switch  M\  opens,  which  fixes  the  voltage  level 
across  Cioad.  Since  M2  is  opened  slightly  before  Mlt  the  M2  switch  determines  the 
sampling  instant.  Since  both  the  drain  and  source  of  transistor  M2  are  at  ground,  no 
signal-dependent  charge  injection  is  introduced.  When  M;  is  subsequently  turned 
off,  the  charge  in  the  channel  sees  an  open  circuit  on  the  side  of  Cioati  and  therefore 
all  the  signal-dependent  charge  that  is  injected  must  flow  to  the  other  side  of  the 
transistor  or  to  the  input  source  (and  again  no  error  is  introduced).  Thus,  the  droop 
can  be  made  constant.  Similar  to  the  feedthrough  case,  we  can  now  employ 
differential  techniques,  and  this  constant  offset  simply  becomes  a  harmless  com- 
mon-mode shift.  Note  this  technique  introduces  yet  another  transistor  M2,  whose 
nonlinear  Ron  has  to  be  considered.  However,  as  noted  above  if  Vrf  can  be  assumed 
to  be  almost  constant  (see  Figure  5.11),  then  the  voltage  at  the  bottom  plate  of  Cioad 
is  almost  constant  and  distortion  introduced  by  Ron  of  M2  is  negligible. 


5.7.4    Sub-Sampled  Mixer 


Finally  there  is  a  special  type  of  sampling  mixer  called  subsampling  mixer,  shown 
in  Figure  5.14.  To  illustrate  subsampling  the  complete  Vrf  (baseband  signal 
superimposed  on  a  carrier)  is  shown. 

Structurally  this  mixer  is  seen  to  be  the  same  as  the  one  in  Figure  5.10.  The  only 
difference  is  that  the  LO  frequency  is  an  integer  submultiple  of  the  normal  LO 
frequency.  In  Figure  5.14,  there  are  six  Vrf  cycles  in  1  LO  period  and  so  the 
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subsampling  factor  is  6.  It  is  seen  that  by  maintaining  this  integer  ratio,  the 
sampling  constant  occurs  at  the  same  phase  of  Vrf.  Hence  in  Figure  5.14,  sampling 
always  occurs  at  the  peaks  of  Vrf,  as  illustrated  by  the  Vif  (sampled),  the  solid  dots. 
The  peaks  of  Vrf  are  again  extrapolated  to  construct  an  envelope  of  Vrf.  It  is 
illustrative  to  compare  Figure  5.14  with  Figure  5.10.  In  both  cases  Va  (sampled) 
is  seen  to  capture  the  envelope  of  Vrf. 

As  a  real-life  example,  if  the  RF  is  1.9  GHz,  to  get  a  100  MHz  IF,  the  LO 
frequency  should  be  1.8  GHz.  However,  this  is  also  achievable  by  using  an  LO  of 
180  MHz.  Essentially  one  can  view  this  as  using  the  tenth  harmonics  of  the  180 
MHz  LO  signal  (at  1.8  GHz)  to  perform  the  sampling. 

There  is  a  limit  on  how  low  the  subsampling  frequency  (or  how  large  the 
subsampling  factor)  can  go.  Essentially  the  lowest  LO  frequency  has  to  be  larger 
than  two  times  the  bandwidth  of  the  baseband  signal.  For  example,  for  DECT,  since 
the  channel  bandwidth  is  1.7  MHz,  the  lowest  LO  signal  has  to  be  larger  than  3.4 
MHz.  This  lower  limit  is  set  by  the  so-called  bandpass  sampling  theorem  -  a  variant 
of  the  Nyquist  sampling  theorem  applied  to  a  bandpass  signal. 

The  main  advantage  of  this  type  of  mixer  is  that  the  time  it  takes  for  the  mixer 
output  voltage  V\0  to  settle  is  reduced  substantially.  Using  DECT  as  the  example 
again,  instead  of  having  to  settle  in  a  period  given  by  1/(2  x  1.8  GHz),  we  can  settle 
in  a  period  of  1/(2  x  0.18  GHz).  This  will  relax  the  requirement  of  the  frequency 
synthesizer  that  drives  the  mixer.  The  disadvantage  of  this  is  that  all  the  noise  at  the 
RF  port,  centered  around  multiples  of  0. 1 8  GHz,  is  now  aliased  into  the  IF  port.  Since 
you  have  10  times  more  images,  the  aliased  noise  power  increases  accordingly. 

A  bottom-plate  version  of  this  subsampled  version  can  also  be  implemented 
rather  easily. 

Next,  as  with  switching  mixers  we  develop  design  equations  that  address  some 
of  these  issues.  In  the  process  we  quantify  the  three  relevant  parameters:  conversion 
gain  (Gc),  distortion  (IIP3),  and  noise  (NF).  Furthermore,  all  cases  are  covered  for 
the  scenarios  when  V\0  is/is  not  switching.  In  addition,  for  noise  and  distortion 
analysis,  in  all  the  cases  when  Vlo  is  switching,  the  particular  case  when  Vlo  is 
switching  with  a  finite  rise/fall  time  is  given  special  treatment.  In  the  sampling 
mixer,  this  is  necessary  and  important  since  it  is  precisely  at  the  falling  edge  of  Vlo 
that  the  output  value  is  determined.  This  makes  the  noise  and  distortion  analysis 
quite  different  from  the  switching  mixer  case.  Most  of  the  analysis  is  focused  on  the 
single-ended  sampling  mixer. 


5.8     Conversion  Gain  in  Single-Ended  Sampling  Mixer 

We  proceed  by  developing  the  conversion  gain  with  Vlo  not  switching.  Then  we 
develop  the  conversion  gain  with  Vlo  switching. 
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Fig.  5.15    Conversion  gain  model  for  the  single-ended  sampling  mixer 


5.8.1     Vi0  not  Switching  Case 


The  model  for  the  single-ended  sampling  mixer  as  described  by  Figure  5.10  is 
developed  in  Figure  5.15. 

In  this  figure  the  channel  resistance  is  modeled  as  a  linear  resistor  Ron  in  series 
with  an  ideal  switch.  The  nonlinear  junction  capacitor  Cdb  is  lumped  together  with 
Cioad  and  is  denoted  as  C.  We  first  develop  the  conversion  gain  by  assuming  that  the 
switch  is  on.  Hence  Gc  becomes 


MfC 


_J 

MfC 


Rn 


(5.43) 


5.8.2     Special  Vi0  Switching  Case 

As  with  the  switching  mixer  case,  if  the  poles/zeroes  frequencies  are  much  higher 
than  /i0,  the  sampling  mixer  satisfies  (4.150),  Chapter  4.  The  conversion  gain  is 
modified  by  the  Fourier  series  coefficient  and  becomes 


M/C 


l 
MfC 


Rn 


(5.44) 


Taking  finite  rise  and  fall  time  (tr,  tf)  into  consideration,  expression  (5.44)  is 
modified  by  (t,/T)  [1]  and  the  impact  is  usually  small. 

5.8.3     General  Vio  Switching  Case 


When  the  poles/zeroes  frequencies  become  comparable  to  /lo,  conversion  gain  is 
found  in  a  manner  similar  to  the  conversion  gain  case  for  switching  mixer,  except 
that  we  are  interested  in  sampled  (rather  than  continuous)  output.  Basically  we  start 
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by  looking  at  the  two  phases  of  the  LO  signal  and  sum  the  response  to  a  tracking 
system  and  a  holding  system.  During  tracking,  we  are  doing  natural  sampling  of  a 
signal,  and  the  response  is  found  by  multiplying  the  switching  function  to  V^. 
During  the  hold  mode,  the  switch  is  turned  off  and  the  output  held.  The  response 
during  this  phase  is  found  by  applying  the  impulse  samples,  weighted  by  the 
sampled  values,  to  the  RC  filter  (an  LTI  system).  The  final  expression  [1]  is 


Gc((Djf,   (Oio) 


Gco(a>jf) 


(Olf        \ 


exp 


Gro(coff) 


ojjfC 


ajjfC 


Rn 


(5.45) 


Here  <Bif  is  the  IF  of  the  mixer  and  is  related  to  corf,  for  a  downconversion  mixer, 
by  K)if  =  ©rf  —  C0io.  Notice  that  compared  with  (5.43)  and  (5.44)  Gc  here  also 
depends  on  ©i0.  The  expression  incorporating  finite  rise  and  fall  time  was  not 
derived  as  it  is  too  complicated  and  involves  an  infinite  number  of  phases. 


5.9     Distortion  in  Single-Ended  Sampling  Mixer 

We  assume  that  distortion  due  to  charge  injection  is  effectively  eliminated  by 
bottom-  plate  sampling.  We  analyze  the  distortion  of  this  bottom  plate  sampling 
mixer  by  first  analyzing  the  unbalanced  simple  mixer.  To  do  this  we  redraw 
Figure  5.12  in  Figure  5.16  where  the  finite  rise/fall  time  is  highlighted.  The  fall 
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Fig.  5.16    Distortion  model  for  single-ended  sampling  mixer  (high  frequency,  finite  fall  time) 
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time  is  labeled  7V.  Now  let  us  concentrate  on  the  causes  of  distortion.  Sources 
include  the  following: 

a.  Traditional  nonlinear  resistance,  Ron,  of  switches  (see  Section  5.3). 

b.  Distortion  due  to  finite  fall  time,  which  though  briefly  mentioned  in  Section  5.3, 
has  become  significant  in  the  present  case  and  deserves  special  attention.  This  is 
because  as  a  result  of  the  sampling  operation,  the  output  value  is  determined 
precisely  at  the  falling  edge  of  the  LO  waveform. 

We  concentrate  on  high-frequency  analysis  of  the  sampling  mixer,  as  this  is  the 
frequency  range  of  interest.  (Low-frequency  analysis  can  be  performed  easily  by 
repeating  this  high-  frequency  analysis.)  As  mentioned  at  the  beginning  of  Section 
5.7,  it  is  best  to  perform  the  analysis  in  the  time  domain,  rather  than  in  frequency 
domain.  Hence  we  apply  the  Volterra  series  calculation,  this  time  in  time  domain. 


5.9.1     High  Frequency  (Tto  not  Switching)  Case 

To  familiarize  the  reader  in  dealing  with  Volterra  series  in  the  time  domain,  we  first 
treat  the  simple  case  where  the  sampling  mixer  is  operated  with  the  V\0  on  (and 
constant)  all  the  time.  We  show  that  distortion  formulas  derived  under  this 
operating  condition  agree  with  the  distortion  formulas  derived  in  sub-section 
5.3.3,  where  Volterra  series  in  the  frequency  domain  is  used.  This  is  not  surprising 
because  under  this  operation  condition  (Vi0  on  and  constant)  sampling  does  not 
occur  and  hence  the  sampling  mixer  operates  in  identically  the  same  way  as  the 
switching  mixer  (with  Vi0  on  and  constant  as  well).  We  further  note  that  the 
sampling  mixer  is  described  by  a  time-invariant  circuit  and  denote  distortion 
generated  under  this  condition  as  continuous  time  distortion. 

Let  us  begin  the  analysis  by  repeating  the  assumptions  we  made  in  sub-section 
5.3.1.  We  again  assume  that  the  distortion  is  mainly  determined  by  the  nonlinear 
Ron  of  the  MOS  transistor  and  that  distortion  due  to  bias-dependent  junction 
capacitance  is  quite  small  for  typical  values  of  MOS  model  parameters.  We  also 
choose  to  ignore  body  effect  for  simplicity  in  analysis.  Therefore,  in  deriving  the 
Volterra  series  coefficient,  we  assume  that  in  Figure  5.16  the  transistor  operating  in 
the  triode  region  is  represented  by  the  most  basic  MOS  equation: 

h  =  k(Vgs-Vt)Vds-~Vl,  (5.46) 

where  k  =  flC™w ,  and  V,  is  the  threshold  voltage.  The  differential  equation  that 
describes  Figure  5.16  thus  becomes 

gV*  +  C^=8V*-±(v$-vi)  (5.47) 
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where  g  is  the  conductance  of  Mx  and  is  defined  as  g  —  k(VG  —  Vt). 
Next  we  express  Vjf(t)  in  its  Volterra  series  in  the  time  domain: 


Vif(t)  =  vi(t)  +  v2(t)+v3(t)  +  ... 


(5.48) 


where  vn(t)  =  H„  [Vrf(t)~\ .  Here  Hn  are  the  Volterra  coefficients  and  should  not  be 
confused  with  the  conversion  gain.  Now  we  substitute  this  expansion  into  (5.47), 
keeping  only  first  three  orders,  and  we  get 


(vi  +  v2  +  v3)  +  C 


Vl  +  V2  +  V3 

dt 


gVrf  ~  ^[Vrf  ~  V2{  -  2vlV2y 


(5.49) 


Note  that  Vrf  is  first  order  since  the  input  is  a  pure  sinusoidal;  the  term  \\  is 
second  order;  and  v\\2  is  third  order.  Now,  equating  the  coefficients, 


dv\ 


gv2  +  C 
gvj  +  C 


dv2 
dt 
dv?, 

dt 


v\-Vl 


2  V'1      ''"' 
-kv\v2 


(5.50) 


The  solutions  to  this  set  of  differential  equations  are  the  Volterra  coefficients  Hx, 
H2,  and  H3.  Since  we  assume  a  constant  gate  voltage,  this  leads  to  a  constant  g,  or 
the  sampling  mixer  can  be  described  as  a  time-invariant  circuit.  Hence  we  can  apply 
time-invariant  Volterra  series  theory  to  the  preceding  equations  and  derive  the 
Volterra  coefficients  as 


tfi(ffli) 


-ja>\C 


(5.51) 


H2(ojuco2) 


|(//l(cQi)//i(cQ2)  ~  1)  _  j(0Jl  +(Q2)C 

g  +j(a>i  +  co2)C      ~  2k{VG  -  V,)1 


(5.52) 


-k-Hi((Oi)-H2((Oi,a)3)      j(oji+oj2  +  oj3)C 
Hi(0JuCO2,C0i)  = — —  = ^ —  (5.53) 


where  C0i,  co2,  ro3  are  the  frequencies  of  the  input  signals.  Here  the  assumption  of 
small  time  constant  (g  »~  jaC)  is  used,  and  the  expressions  are  symmetrized. 
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From  (5.51)  through  (5.53)  we  get  the  harmonic  and  intermodulation  distortion.  If 
the  mixer  is  operating  at  al{,  then  we  have 


A,f  jCOrfC 

T '  k(vG  -  vtf 

"4" '  k(VG  -  Vtf 

IM2  =  0  (5.56) 


UD2=^--  ,,.J       ..  2  (5.54) 


flD3  =  ^ ■  ,,!      \,  3  (5-55) 


4  £(yG  _  v,)3 

Note  that  if  the  R  term  in  (5.15)  is  eliminated,  then  (5.57)  becomes  similar 
to  (5.15). 


5.9.2    High-Frequency  (Special  Vi0  Switching)  Case; 
Vi0  has  Finite  Rise  and  Fall  Time 

It  must  be  emphasized  that  the  preceding  analysis  assumes  a  constant  gate  voltage. 
This  applies  to  the  case  of  a  perfect  square  wave  LO  and  when  the  mixer's  RC  time 
constant  is  much  smaller  than  T.  However,  if  the  LO  voltage  deviates  from  the  ideal 
square  wave,  several  new  distortion  effects  start  to  emerge. 

First  the  nonlinear  Rom  which  is  a  function  VG,  now  becomes  time  varying 
during  the  fall  time.  This  introduces  time-varying  distortion.  Second,  the  precise 
instant  of  sampling  depends  not  only  on  the  gate  falling  slew  rate  but  also  on  the 
input  amplitude  and  frequency.  This  leads  to  a  situation  where  the  final  sampled 
voltage  on  the  sampling  capacitor  when  M;  turns  OFF  deviates  from  the  steady- 
state  voltage  on  the  capacitor  and  leads  to  sampling  distortion. 

The  preceding  effects  can  no  longer  be  predicted  with  time-invariant  Volterra 
series  system  theory.  This  leads  us  to  develop  the  theory  of  time-varying  Volterra 
series  in  the  next  sub-section. 


5.9.2.1     Time-Varying  Distortion 

As  mentioned  in  the  previous  sub-section,  it  is  necessary  to  generalize  Volterra 
series  to  the  time-varying  case  in  order  to  analyze  a  sampling  mixer  with  an 
arbitrary  LO  waveform.  In  this  section,  we  develop  a  time-varying  Volterra  series, 
explore  its  frequency  response,  and  show  that  time-varying  Volterra  series  can  be 
applied  in  the  sampled  data  domain  to  solve  for  time-varying  distortion  exactly. 
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Impulse  Response  of  Time-Varying  Systems 


The  notion  of  impulse  response  can  be  easily  generalized  to  time-varying  systems 
by  the  addition  of  another  time  variable.  Alternately,  we  can  start  from  an  LTV 
system,  as  discussed  in  sub-section  4.7.2.1,  Chapter  4,  and  add  nonliniearity  to  it. 
Just  to  review,  the  impulse  response  of  an  LTV  system  is  a  function  of  two 
variables,  h\(t,  r),  which  is  defined  as  the  response  of  the  system  for  an  input  of 
S(t  —  t).  In  this  case,  the  system  response  for  an  arbitrary  input  becomes 


y{t)  = 


h\(t,x)x(x)dx 


(5.58) 


Time  variable  t  is  called  the  observation  time,  and  x  the  launch  time  because 
hi(t,  t)  represents  the  output  observed  at  time  t  for  an  impulse  launched  at  time  t. 
Under  general  continuity  conditions,  a  linear  time-varying  system  can  be 
completely  characterized  by  its  two  dimensional  impulse  response,  or  its  kernel. 

Higher-order  kernels  for  nonlinear  LTV  systems  are  generalized  in  a  similar 
way.  For  example,  a  second-order  kernel  has  two  launch  time  variables  plus  an 
observation  time  variable.  Ii2(t,  Ti,  t2)is  the  system  response  to  two  impulses 
launched  at  time  instants  t\  and  %i-  In  general,  a  mildly  nonlinear  time-varying 
system  has  the  following  Volterra  series  expansion: 


y(t) 


h1(t,Xi)x(xl)dxl 

h2(t,  Ti,  x2)x(xi)x(x2)dxidx2 
hj,(t,  X\,  T2,  X3)x(xi)x(x2)x(xj,)dxidx2dx3 


(5.59) 


Recall  that  in  the  linear  time-invariant  case,  the  impulse  response  may  be  found 
in  either  the  time  domain  or  the  frequency  domain.  In  fact,  the  frequency  domain 
solution  is  often  much  easier  to  obtain  than  the  time  domain  impulse  response. 
However,  for  time-varying  systems,  frequency  response  is  no  longer  well  defined, 
and  hence  it  is  necessary  to  apply  impulses  at  the  system  input  and  find  the  output 
expression  in  the  time  domain.  This  amounts  to  solving  the  differential  equation 
directly,  which  is  not  trivial  in  general.  But  for  first-order  differential  equations, 
such  as  the  sampling  mixer  equation,  this  method  is  feasible. 

Suppose  that  in  (5.50),  g  varies  with  time.  In  particular,  g  goes  to  0  when  the 
transistor  turns  off.  The  zero  state  response  of  the  first-order  equation 

„dv\ 

gVl+C—  =  gVrf 


is  obtained  using  integrating  factor: 


vi  (0 


J"  -£-Vrf{)i)  dp 


(5.60) 
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>Vif 


Fig.  5.17    Composition  of  second-order  system  from  first-order  systems 


Setting  Vrf(n)  =  d(/i  —  x)  gives  the  impulse  response  of  the  first-order  system: 


h(t,x) 


-i^sM 


C       if 


0 


t  >  X 
t  <  X 


(5.61) 


Next,  we  calculate  the  second-order  kernel.  In  general,  a  second-order  system 
can  be  thought  of  as  the  composition  of  linear  systems  as  shown  in  Figure  5.17.  In 
particular,  if  the  impulse  responses  of  the  linear  systems  are  ha(t,  x),hb{t,  x),  and 
hc(t,  x)  respectively,  the  second  order  kernel  can  be  computed  as  follows: 


h2(t,  Ti,  T2) 


hc(t,t)ha(x,ti)hb(t,T2)  dx 


(5.62) 


Expression  for  the  third-order  system  is  similar.  If  in  Figure  5.17  ha  is  second 
order,  and  therefore  the  overall  system  is  third  order,  then  the  overall  system  kernel  is 


h(t,   Tl,   T2,   T3) 


hc(t,x)ha(x,xllx2)hh(x,x3)  dx 


(5.63) 


Therefore,  as  was  done  with  the  time-invariant  case,  h2  and  h3,  can  be  solved 
consecutively.  For  example,  the  second-order  kernel  is  found  by  substituting  Vi(x) 
from  (5.60)  and  setting  input  as  an  impulse  in  the  solution  to  the  second-order 
equation  in  (5.50): 


v2(0 


r"rfff  G)0*M-4M)* 


(5.64) 


Since  the  differential  equations  are  linear  and  first  order,  there  is  no  theoretical 
difficulty  in  obtaining  the  solution.  Following  this  method,  the  complete  time 
domain  solution  to  the  system  can  be  obtained.  However,  the  mathematic  is  very 
tedious. 

Now,  in  a  sampling  mixer  where  the  output  points  of  interest  are  at  the  sampled 
points,  as  opposed  to  continuous  time  output,  it  will  be  shown  in  the  next 
sub-section  (by  exploring  the  frequency  domain  interpretation  and  by  taking 
advantage  of  the  fact  that  output  is  in  the  sample  data  domain)  that  the  problem 
can  be  simplified  significantly.  Closed-form  solution  can  be  obtained. 
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Simplification  of  time  varying  impulse  response  in  the  special  case: 
periodic  time  varying  with  sampling 

Suppose  that  sampling  occurs  at  time  0.  Let  h\  (t,  t)  be  the  time  varying  kernel 
of  the  system  with  a  nonzero  fall-time  LO  waveform.  The  linear  term  in 
the  sampled  output  voltage  is 


v(0) 


/zi(0,t)x(t)  dx  +  ZIR 


(5.65) 


where  T  is  the  sampling  period,  and  ZIR  is  the  zero  input  response  due  to  the  initial 
condition.  Since  we  have  assumed  that  the  RC  time  constant  for  the  sampling  mixer 
is  much  smaller  than  the  sampling  period,  ZIR  is  negligible.  This  is  equivalent  to 
saying  that  any  poles/zeroes  of  the  sampling  mixer  have  frequencies  much  higher 
than/i0  or  that  (4.150),  Chapter  4,  is  satisfied.  Again,  because  of  the  small  time 
constant,  we  may  replace  the  lower  limit  of  integration  in  (5.65)  by  —  oo: 


y(o) 


h\  (0,  x)x(t)  dx 


(5.66) 


Next  we  make  a  few  observations. 

Observation  1: 

What  is  the  value  of  y(T)l  Replacing  0  by  T  in  (5.66),  we  have 


y(T) 


h\(T,x)x(x)  dx 


(5.67) 


We  do  not  know  h\(T,  x)  in  general.  However,  we  do  know  that  the  system  is 
not  only  time  varying,  but  periodically  time  varying  in  time  T.  This  follows  from 
the  fact  that  the  LO  waveform  is  periodic  in  T  which  implies  that  the  system  is 
periodic  in  T. 


.'.h\(t  +  u,  u)  =  h\(t  +  u  +  T,  u  +  T) 

which  is  the  basic  property  of  periodic  time-varying  system. 
Substituting  (5.68)  in  (5.67),  we  have 


(5.68) 


y(T)  = 


hi(0,x  —  T)x(x)  dx 


(5.69) 
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Let  us  redefine  X\  =  x  —  T: 


y(T) 


hi(0,xi)x(Ti +  T)dx1  (5.70) 


Hence,  y(T)  can  be  obtained  by  using  the  same  impulse  response,  hx,  that  is  used 
to  generate  y(0). 

Just  to  highlight  the  unique  position  that  T  assumes,  we  can  see  that  y(T/2),  for 
example,  cannot  be  obtained  by  this  method.  This  is  because: 

T  T 

hi(t+u,u)  7^  h\(t  +  u  +  -,u  +  -) 

The  usefulness  of  expressing  y(T)  in  (5.70)  will  become  even  more  apparent  if 
we  start  to  express  y(2T),  y(3T)  . .  .in  a  similar  fashion.  Following  the  preceding 
method, 


y(2T)  = 


h1(0,x)x(x  +  2T)  dx 


Now  if  we  define  anew  system,  where  we  collect  y(0),y(T),y(2T). . .  and  soon. 
As  the  output  of  the  new  system,  x(0),  x(T),  x(2T)  as  the  corresponding  inputs,  we 
can  see  the  following: 

a.  They  are  related  by  the  same  convolution  integral  with  the  same  impulse 
function  /zi(0,  x)  and,  most  important,  the  impulse  response  is  a  one-dimen- 
sional impulse  response. 

b.  The  relationship  is  time  invariant  [e.g.  as  x{t)  is  shifted  to  x(t  +  T),  it  is  the  same 
hi(0,  x)  that  is  used  to  generate  y(t)  and  y(t  +  T)]. 

Therefore,  this  describes  an  LTI  system.  Extending  this  argument,  we  can 
likewise  transform  a  bilinear  time-varying  system  to  a  bilinear  time-invariant 
system  if  we  are  interested  only  in  the  sampled  point.  Therefore,  if  h\(t,  X\,X2) 
describes  a  bilinear  periodic  time  varying  system  the  sampled  point  can  be 
described  by  the  equivalent  bilinear  time  invariant  system  h\  (0,  Ti,  T2). 

Observation  2: 

Since  the  output  voltage  of  interest  is  at  the  sampling  instant,  the  time  varying 
characteristic  of  importance  occurs  during  the  time  within  a  few  time  constants 
before  the  sampling  instant.  This  means  that  the  same  Volterra  kernel  applies  to 
every  sampling  instant  except  that  the  kernels  are  shifted  by  nT,  integer  multiples  of 
the  LO  period.  To  simplify  computation,  we  want  to  keep  the  Volterra  kernel 
identical  for  each  sampling  period.  Therefore,  instead  of  shifting  the  kernel,  we 
shift  the  input  signal  backward  in  time  by  nT  and  keep  the  sampling  instant  at  time  0. 
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This  effectively  keeps  the  functional  form  of  the  kernel  identical  for  all  samples. 
Further,  there  is  nothing  special  about  sampling  points  nT.  Hence  if  sampling  occurs 
at  an  arbitrary  time  t,  the  sampled  voltage  can  be  represented  as 


y(t) 


hi{0,x)x{x  +  t)dx  (5.71) 


This  y(f)  is  a  fictitious  signal,  whose  value  at  t  represents  the  sampled  value  of 
the  output  if  sampling  is  to  occur  at  t.  If  sampling  occurs  at  instants  nT,  the  output 
voltage  samples  are  just  y(T),  y(2T),. .  .,y(nT).  Hence  the  nonideal  sampling  of  the 
output  signal  is  reduced  to  the  ideal  sampling  of  y(f),  where  y(t)  is  related  to  the 
input  x(t)  by  (5.71).  This  is  convenient  because  harmonic  distortion  for  the  ideal 
sampled  y(nT)  is  the  same  as  the  harmonic  distortion  of  the  continuous  signal  y{t). 
So  the  problem  of  calculating  distortion  in  nonideal  sampled  output  is  reduced  to 
the  problem  of  calculating  the  distortion  in  the  continuous  signal  y(t). 

The  transformation  from  (5.65)  to  (5.71)  also  reduces  a  time  varying  system  to  a 
time-  invariant  system,  as  the  relation  between  x  and  y  in  (5.71)  is  time  invariant. 
This  situation  is  analogous  to  the  analysis  of  discrete  control  systems  where 
although  a  zero-order  hold  is  not  a  time-invariant  operation  in  the  continuous 
time  domain,  the  input  output  relation  is  nevertheless  time  invariant  in  the  sampled 
data  domain. 

Now  that  we  have  made  some  key  observations,  it  remains  to  find  the  impulse 
response  of  the  linear  time-invariant  system  described  in  (5.71).  Let  us  denote  its 
impulse  response  in  time  and  frequency  domain  by  h\(t)  and  H\((o)  respectively. 
To  find  hu  set  x(t)  =  8(t)  in  (5.71).  It  follows  that 

h(t)={T,~t)  if ;5o  (5-72) 

The  frequency  response  is  its  Fourier  transform: 


tfi(ffl) 


hi(0,  x)ejon  dx  (5.73) 


where  again,  h\(t,  x)  is  the  time  varying  kernel. 

The  same  technique  applies  to  higher  order  systems.  For  a  second  order  system, 
the  sampled  data  domain  response  is: 


y(t)  = 


0 


0 


h2(0,  Ti ,  x2)x(x\  +  i)x(x2  +  t)  dx\  dx2  (5.74) 


and  the  frequency  response  is 


H2((Oi,  a>2) 


0 


0 


h2(0,  n,  x2)ejoJlT,ejl02T2dx1  dx2  (5.75) 
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Again,  the  second-order  kernel  may  be  obtained  from  its  composition  from  first- 
order  systems  as  in  (5.62). 

The  frequency  domain  expression  is  of  most  interest.  It  is,  in  fact,  possible  to 
bypass  the  time  domain  expression  and  obtain  the  frequency  domain  result  directly. 
Assuming  a  composition  form  as  in  Figure  5.17,  substituting  (5.62)  into  (5.75),  but 
noting  t  =  0,  then 


H2((Oi,co2)  =  hc(0,  T3)/zfl(T3,  x\)hb 

J  — OC   J  —  DC   J  —  DC 

x  (t3)  x2)ei<0meiC02Z2dx3  dxl  dx2 
More  succinctly  and  for  convenience  only,  define 


H\(w,  t) 


and 


H2(coU(o2,t) 


hi(t,x)ejon  dx 


h2(t,xux2)ejaiTuej'°2T2dxi  dx2 


(5.76) 


(5.77) 


(5.78) 


Then,  again  substituting  (5.62)  into  (5.78)  and  simplifying  the  resulting  expres- 
sion using  (5.77)  and  we  arrive  at  the  formula 


H2((ou  a>2,  t) 


hc(t,x)Ha(coux)Hh(co2,x)  dx 


(5.79) 


In  particular,  setting  t  —  0  gives  the  sampled  frequency  domain  kernel  for  the 
system  Figure  5.17 


H2(mu  (o2) 


hc(0,  x)Ha{a>\ ,  x)Hb(m2,  x)  dx 


(5.80) 


The  third  order  frequency  domain  representation  is  similar.  If  ha  in  Figure  5. 17  is 
second  order,  then  the  resulting  Volterra  kernel  becomes 


//3(cai,  m2,  co3,  t) 


hc(t,x)Ha(couoj2,x)Hh(co3,x)  dx  (5.81) 


and  the  sampled  frequency  domain  kernel  is 


H3(a>i,  co2,  cu3) 


hc(0,x)Ha(col,co2,x)Hh(a)3,x)  dx  (5.82) 
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Derivation  of  Volterra  coefficents  for  time-varying  distortion 

In  this  sub-section,  we  analyze  the  sampling  mixer  of  Figure  5.16  in  the  sampled 
data  domain.  Assuming  that  the  input  side  is  source,  the  output  drain,  from  the  basic 
MOS  equation,  we  have 


k(Vg-Vs-V,)(Vd 


vs)-^(vd 


vsf+c 


dVd_ 

dt 


0 


(5.83) 


(Assuming  the  output  side  being  source  will  give  a  slightly  different  differential 
equation,  but  the  form  of  the  answer  is  the  same.) 

Remember  the  M\  conductance  g  =  k[Vgs  —  V,).  Mi  enters  cut-off  region  at 
Vgs  —  Vt=0,  so  the  value  of  g  goes  to  zero  at  each  sampling  point.  Now,  since  we 
have  assumed  that  the  RC  time  constant  is  small,  the  region  of  most  importance  is 
the  time  right  before  the  sampling  instant.  Therefore,  we  can  approximate  g  by  a 
linear  function  prior  to  each  sampling  point.  This  assumption  is  most  valid  if  the 
system  time  constant  is  smaller  than  the  fall  time  of  the  gate  voltage.  In  practical 
cases  where  this  is  not  true,  the  analysis  yields  a  limiting  case  and  provides  a  useful 
bound.  Now,  the  slope  of  the  linearly  varying  g  is  just  the  slope  of  the  cutting  edge 
Vg  minus  the  slope  of  the  input  signal  Vs,  to  a  first-  order  approximation.  So  if  we 


define  f> 


_  (2kVG) 


where  VG  and  7}  are  as  indicated  in  Figure  5.17,  and  define  g  = 


-Rt,  the  MOS  equation  (5.83)  becomes 


dVs     \,  ,      k,  n2       dVd 

S~k-^-t)(yd-  Vs)  -  -(Vd  -  V,)2+C^  =  0 


dt 


dt 


(5.84) 


Let  Vs  =  Vrf,  Vd  =  Vjf,  expand  V,/into  its  Volterra  series  as  before  and  collect  the 
first  to  the  third-order  terms.  We  have 


dv\ 

gV\  +C~dJ  =  SVrf 

d\'2  Vrf    ,  s         k  ,  x  2 

*v2  +  C-  =  ^r(v1-V,/)+-(v1-V,/) 
^+Cd^  =  ^tV2  +  k(v1-Vrf)v2 


(5.85) 


To  calculate  //1;  we  use  (5.77),  where  the  impulse  response  h\(t:  t)  has  already 
been  solved  in  (5.61): 


Hi(co,t)= 


hi(t,x)ejon  dx 

e-j;^gMej™dx 

C 

e-lc2(T2-'2)(-2K2r)eJmdt 


(5.86) 
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where  the  substitution  of  g  =  —2>t  and  k2  =  ^  are  made.  This  integral  does  not 
have  closed-form  solutions.  However,  we  are  interested  in  the  value  of  the  integral 
when  t  is  close  to  0,  so  the  upper  limit  of  the  integral  is  close  to  zero.  The  integrand 
is  a  rapidly  increasing  function  of  t  as  i  approaches  0,  so,  effectively,  the  only 
relevant  portion  of  the  integration  is  when  x  is  close  to  0.  Therefore,  we  can  assume 
that  eJ0JT  «  1  +  jmx.  In  this  case,  the  integral  becomes 


#i(o,  0  =  1  +jo)t-jo)J*  ■  -  ■  eK2,2(erf(Kt)  +  1) 


(5.87) 


The  error  function  is  defined  as  erf  (?)  =  -j-  J0  e  x  dx 
So, 


H\(co)  =  1  —jco\ 


jn    1 
1"k 


(5.88) 


The  second-order  term  is  evaluated  using  (5.79),  where  the  time-varying  kernel 
is  obtained  from  its  composition  from  first  order  terms.  Mechanically,  v1  is 
substituted  by  H\ ,  and  vrf  is  substituted  by  e'a". 


H2{coi,(02,t) 


~c 


k  ■  jcoi  ■  ejoJ,T  ■  x  ■  (Hi(co2, t)  -  eJ(0lX)  dx 


.-j>< 


(A(cBi,T)-e^T) 


x  (Hl(o)2,x)-ejo)2T)  dx  (5.89) 

Again,  approximate  eJC0t  by  its  Taylor  expansion,  substitute  (5.87)  to  (5.89): 


H2{o)Uo)2,t)=^-k-J----eKt 


x(erj(KX)  +  1)  dx  H — — 


k    /n\   1 


U/k2 


eK  x  (erf(Kx)  +  l)2  dx 


(5.90) 


Fortunately,  at  ?  —  0,  the  last  two  integrals  may  be  evaluated  numerically,  and  we 
have: 


H2(o)\ ,  o)2)  =  -CO1CO2  [  ~  J  •  3 


(5.91) 
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where  b  =  0.234  comes  from  the  evaluation  of  definite  integrals.  (The  relative 
contributions  from  the  two  integrals  are  actually  comparable.)  Finally,  H3  is 
obtained  in  the  same  way: 


H3(a>i,  co2,  co3,  t) 


e-J>< 

— k  ■  jco\  ■  eJW,T  ■  x  ■  #2(0)1,  o»2,  t)  dx 

-k(Hl(co3,x)-ej(^)H2 


r 

x  (co1,a>2,x)  dx  (5.92) 

We  need  to  evaluate  the  above  expression  for  t  =  0.  Because  H2  terms  contains 
two  integrals,  there  are  in  total  four  definite  integrals.  This  is  tedious  but  not 
impossible.  Numerically,  the  answer  turns  out  to  be: 

H3(coi,  o)2,  co3)  =jcoico2c0i  ■  f-J   •—  (5.93) 

where  c  —  0.0913  numerically.  Equations  (5.88),  (5.91)  and  (5.93)  are  the  final 
solutions  to  the  sampled  time-varying  Volterra  series.  To  summarize  the  results  in 
terms  of  circuit  parameters, 


3/2 
Hi{co\,  0)2)  =  -CO1CO2  ■  \jy(  ^j-\      -b  (5.95) 


k\VG) 


Hs((Oi,  m2,  co3)  =ja)lw2m3  ■  \J-j-\tJ-)      'c  (5-96) 

where  again  a  —  0.866,  b  =  0.234,  c  =  0.0913  numerically.  Recall  that  this  result  is 
obtained  by  assuming  a  linearly  decreasing  g.  In  reality,  g  is  nearly  constant  for  a 
large  part  of  the  sampling  period.  Therefore,  this  solution  is  an  asymptotic  case.  The 
closed-  form  solution  for  HD3  and  IM3,  assuming  a  linear  falling  edge,  can  now  be 
derived  and  are  shown  in  (5.97)  and  (5.98): 

A^col  f7t\  3/2  [C\  1/2  /  Tf\  5/2  c 
HD3  =  -j-^-^  (-)      [-)      l-f)      -5  (5.97) 


v4/       \kj       \VGJ 
where  Tf  is  the  fall  time  and  8  is  a  constant  with  a  numerical  value  0.1036. 
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If  we  compare  the  Volterra  coefficients  for  the  nonideal  MOS  sampling  mixer 
with  the  ideal  sampling  case,  we  find  that  the  nonlinearity  due  to  the  time-varying 
nature  is  much  smaller  than  the  ideal  sampling  case  at  low  frequency  and  will 
exceed  other  nonlinearity  at  high  frequency.  Notice  that  the  time-varying  distortion 
varies  with  cube  of  frequency,  so  theoretically  at  high  frequency  the  time-varying 
distortion  may  overtake  the  continuous  time  distortion.  For  typical  sampling  mixer 
design  this  will  happens  at  an  fr!  of  around  1  GHz.  The  fact  that  time-varying 
distortion  is  small  at  lower  frequency  can  be  explained  intuitively.  There  are  two 
sources  of  nonlinearity  from  the  M;  governing  equation: 

h  =  k{Vgs-Vt)Vds--Vl  (5.99) 

An  obvious  source  of  nonlinearity  is  the  square  term  Vjs,  but  this  effect  is 
secondary  for  the  ideal  sampling  case,  where  the  major  contributing  factor  is  the 
input  signal  dependent  conductance  k(Vgs  -  V,).  However,  in  a  non-ideal  sampling 
mixer,  the  first  order  signal  dependence  in  g  is  eliminated  because  the  sampling 
always  occurs  when  Vgs-Vt  —  0;  hence  the  local  behavior  of  g  prior  to  cutoff  is 
approximately  the  same  for  each  sampling  point.  Hence  the  only  nonlinear  factors 
left  are  the  signal  dependence  in  the  derivative  of  the  conductance  and  the  Vjs  term, 
which  are  significantly  smaller.  However,  we  are  not  really  getting  something  for 
free  here.  Although  in  the  case  of  the  nonideal  mixer  the  k(V?s  —  Vt)  term  causes 
much  less  input  dependence  in  g,  the  V?s  term  manifests  as  an  altogether  different 
source  of  distortion.  Because  cutoff  occurs  when  Vgs  —  Vt  =  0,  the  cutoff  time  is 
now  input  signal  dependent.  In  other  words,  the  distortion  in  the  amplitude  domain 
is  translated  into  the  time  domain  by  nonideal  sampling.  The  signal-dependent  cut- 
off time  is  called  sampling  error,  which  is  the  familiar  distortion  caused  by  finite  fall 
time  reported  for  an  SAH  circuit.  Its  effect  is  similar  in  the  sampling  mixer  case.  For 
the  sake  of  consistency,  we  will  re-derive  that  using  the  Volterra  series  in  the  next 
sub-section.  The  result  should  be  similar  to  that  reported  in  the  literature. 


5.9.2.2     Sampling  Distortion 

In  the  development  of  time-varying  distortion  analysis  in  the  sampled  data  domain, 
the  output  voltage  is  assumed  to  be  sampled  at  equally  spaced  instants.  This 
assumption  is  not  exactly  true  given  that  the  gate  voltage  has  nonzero  fall  time. 
The  instance  of  sampling  is  when  V?s  —  Vt  —  0,  so  the  time  at  which  sampling  occurs 
depends  not  only  on  gate  voltage  but  also  on  the  input  voltage.  The  signal-dependent 
sampling  time  introduces  additional  distortion  term,  which  can  be  quantified  using 
the  Volterra  series  again  (strictly  speaking,  this  distortion  can  be  calculated  without 
the  use  of  the  Volterra  series.  The  Volterra  series  is  used  here  mainly  for  consistency). 
Referring  to  Figure  5.18,  suppose  that  the  input  signal  V,.f  is  smooth,  and  we 
intend  to  sample  the  input  at  time  0.  Let  the  gate  voltage  V/o  have  a  finite-slope 
falling  edge  with  slope  a,  where  a  —  -j8-  as  shown  in  Figure  5.18.  The  falling  edge 
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Fig.  5.18    Diagram  showing 
sampling  error 


has  the  form  Vi0(t)  =  Vg~  at.  Since  the  MOS  transistor  enters  the  cutoff  region  at 
Vg  —  Vs  =  Vt,  and  the  input  is  applied  at  the  source  side,  sampling  occurs  at  the 
instant  Ts  when  Vi0(Ts)  —  V,  —  V,-f(Ts).  Now,  using  the  approximation 
V,f(Ts)  k,  V,/(0)  +  VL(0)t,  we  can  solve  for  the  cutoff  time: 


Ts  = 


Vg  -  V,  -f(0)  _  Vg  -  V,  -/(Q) 
a+/'(0)       "  a 


/(0)  ,  /'(o)2 


(5.100) 


where  we  assume  that  a  is  large.  Next,  substituting  the  above  expression  for  Ts  into 
Taylor  expansion  for  V,j(Js)  around  0,  and  collect  the  first  three  order  terms, 
we  have: 


vrf{t)  =  w(o) 


^(o)^(o)     /^(o)y,2/(o)    v>lf(o)2vrf(oy 


2a2 


(5.101) 


where  again  the  assumption  of  large  a  is  used.  Therefore,  due  to  the  finite  fall  time, 
the  sampled  value  V,-/Ts)  differs  from  the  desired  vale  V,/(0)  by  additional  signal 
dependent  terms,  which  produces  distortion.  Since  the  sampling  time  is  arbitrary, 
(5.95)  is  valid  if  time  0  is  replaced  by  an  arbitrary  sampling  time. 

The  signal  dependence  can  be  analyzed  by  the  Volterra  series.  The  linear  term  in 
(5.101)  is  V,/(0),  so  Hj  =  1.  The  second-order  term  is  the  product  of  the  derivative 
with  the  function  itself.  Using  composition  from  linear  terms  and  using  symmetri- 
zation,  the  Volterra  series  coefficient  is  found  to  be 


H2(coi,  co2) 


y(o>i  +  m2) 

2a 


(5.102) 


and  for  the  third  order, 


H3(mu  ffl2,  cu3) 


(a>i  +co2  +  a>3) 
60J2 


(5.103) 
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Table  1.1    Summary  of  distortion  behavior  of  a  single-ended  sampling  mixer 
Sources  HD3  IM3 

Continuous  time  distortion  ^     jmrlc  Same  as  HDt, 

4    k{Va-V,)3 

Time  varying  distortion  A2„f        30       10  /T,\5fl  3  ■  HD3 


-j^^^^f.0.0^ 


Sampling  distortion  3  (a^co^tA  7 

i\    2Va     ) 


where  ©1,  ©2,  fi>3  are  the  frequencies  of  the  input  signals.  If  the  mixer  is  operating 
at  corf,  using  the  proper  definitions,  harmonic  distortion  and  intermodulation  can  be 
calculated  exactly: 

HD3  =  -4-  Y^rr^A  (5.104) 


4    V    V, 


G 


HDs  =?$■(%&-)  (5.105) 


32    V  VG 
IM2  =  0  (5.106) 


IM3  =  A'nte^rence  (°^Il\  (5.107) 


32        V  vt 

Comparing  the  sampling  distortion  as  given  by  (5.104)  through  (5.107)  with  the 
continuous  time  distortion  as  given  by  (5.54)  through  (5.57),  it  can  be  shown  that 
the  sampling  distortion  becomes  comparable  to  the  continuous  time  distortion  when 
the  fall  time  is  in  the  order  of  yJxT,f,  where  t  is  the  RC  constant  formed  by  the  MOS 
resistor  and  the  load  capacitor,  and  T,f  is  the  period  of  the  input  signal  Vrf. 
Therefore,  sampling  error  is  a  bigger  problem  at  high  frequency. 

Table  5.1  summarizes  the  distortion  behavior  of  an  unbalanced  sampling  mixer. 

For  any  given  7/ we  have  three  3  different  sources  of  distortion  as  summarized  in 
Table  5.1.  We  should  calculate  the  IM3  for  this  given  Tf  and  do  a  comparison. 
Whichever  is  the  largest  dominates,  and  we  call  that  the  IM3  of  the  mixer 


5.9.3    High-Frequency  Case  in  Bottom-Plate  Single-Ended 
Sampling  Mixer 

The  preceding  analysis  is  now  applied  to  the  bottom-plate  sampling  mixer  as  shown 
in  Figure  5.13.  First  Figure  5.13  is  redrawn  in  Figure  5.19.  to  show  the  parasitic 
capacitance.  Here  Cj,  Cp  are  the  parasitic  capacitance  associated  with  M;  and  M2, 
respectively. 
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Fig.  5.19    Bottomplate  single-ended  sampling  mixer  with  parasitics  capacitance 

The  derivation  of  the  distortion  formula  is  given  in  [1].  We  skip  the  details  and 
just  present  the  final  closed-form  solutions.  For  typical  MOS  device  parameters, 
except  at  frequencies  higher  than  1  GHz,  we  neglect  the  time-varying  distortion.  It 
is  seen  that  as  the  bottom  switch  M2  opens,  distortion  due  to  sampling  distortion  is 
small.  This  is  because  the  source  and  drain  of  the  bottom  switch  are  kept  almost 
constant  at  the  ground  level,  which  eliminates  signal-dependent  sampling  error. 
Therefore,  distortion  is  due  mainly  to  continuous  time  distortion.  When  the  top 
switch  opens,  the  capacitor  is  primarily  due  to  parasitic  capacitance.  With  the  much 
smaller  capacitance,  the  continuous  time  distortion  is  small.  The  sampling  distor- 
tion, however,  is  the  same  as  the  case  of  the  single-ended  simple  sampling  mixer 
(Figure  5.10)  and  hence  dominates.  Therefore,  the  total  distortion  is  a  combination 
of  the  two  (each  weighed  by  the  proper  capacitor  ratios  due  to  the  capacitor  divider) 
as  follows: 


HDt, 


IM3  = 


Chad        \ 

Al  mf(Ci0ad  +  Cj)     /     cp     \ 

34 

32 

fcQrfTfV 

{-'load    <    ^pj 

4          *l(VG-VT)3             KCload  +  Cp) 

V  vG  J 

(5.108) 

I-' load 
{-■load    \    *-f 

A2 

interference 

32 


A2 

interference 


(OrfTi 


oad 


c, 


j(orf(Ci, 

ki(VG  +  VT)3 


C„ 


c, 


load 


C, 


V1f 
VG 


(5.109) 
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These  formulas  are  similar  to  the  case  of  a  single-ended  simple  sampling  mixer. 
However,  the  major  difference  is  that  the  distortion  due  to  sampling  (when  the  top 

switch  opens)  is  reduced  by  a  factor  of  -, — v ,  which  is  significant.  Therefore, 

sampling  distortion  is  less  of  a  problem  in  a  bottom-plate  sampling  mixer  than  in  the 
simple  sampling  mixer.  Finally,  for  a  bottom-plate  subsampled  mixer,  the  distortion 
formulas  remain  the  same. 


5.10     Intrinsic  Noise  in  Single-Ended  Sampling  Mixer 
5.10.1     Vi0  not  Switching  Case 

The  noise  calculation  is  just  a  repetition  of  the  switching  mixer,  except  that  instead 
of  terminating  the  mixer  with/?ioa(j,  it  is  now  terminated  with  Cioad-  All  the  formulae 
can  be  rederived  with  this  change.  We  will  skip  the  details  and  present  the  final 
answers. 

In  this  sub-section  we  follow  the  notation  developed  in  sub-section  5.5.1.  We 
assume  that  the  switch  is  on  and  hence  the  output  noise  PSD  is  given  by 

SD  =  4kTRm  (5.110) 

Input  referred  noise,  Ndev, lS  men  given  by 

Ndev=^  (5.111) 

where  Sn  is  given  by  (5.110)  and  Gc  is  given  by  (5.43).  Hence  NF  can  be  calculated 
by  substituting  the  Ndev  so  obtained  in  (5.35). 

As  a  side  note  notice  the  noise  bandwidth//,  is  given  as 

fh  =  n/2  x  (bandwidth  of  RC  filter)  =  n/2  x  (l/(27itfonCioad) 

=  l/(4Xjr?onCioad)  (5.112) 

If  we  integrate  S„  as  given  by  (5. 1 10)  throughout  the  entire  bandwidth  of  interest 
(=//,)  to  find  the  variance  of  the  noise,  <jn  ,  we  note  that  Ron  is  canceled  and  that 
ct„   is  given  as 


al 


fb 

1  kT 

S„  df  =  fb  x  4kTR0„  =  x  4kTR0„  =—-  (5.113) 

^"^  on  *-•  load  ^  load 

0 


276  5     Passive  Mixer 

5.10.2     Vi0  Switching,  Both  Special  and  General  Cases 

First  the  output  noise  PSD  in  the  special  Vlo  switching  case  is  presented.  The  output 
noise  PSD  in  the  general  V\0  switching  case  will  be  studied  in  Q7. 

Let  us  follow  the  notation  in  sub-section  5.5.2  and  Figure  5.7.  We  also  represent 
the  sampling  mixer  by  Figure  5.7.  With  V\0  switching,  the  noises  at  node  1  in  Figure 
5.7  are  just  aliased  on  top  of  one  another  to  form  the  noise  at  node  2.  The  PSD  of 
noise  at  node  1  is  given  by  (5.110)  and  repeated  as 

Sni  =  4kTRon  (5.114) 

The  noise  bandwidth  is  given  as  fh  and  the  switching  frequency  is  given  by  /lo. 
Due  to  aliasing,  all  noises  centered  at  frequencies  that  are  around  multiples  of  flo 
and  below  //,  will  fall  on  top  of  one  another.  Hence  the  output  noise  PSD  will 
increase  by  a  factor  of //,//lo.  In  addition,  due  to  switching  there  is  a  difference  in 
conversion  gain  that  has  to  be  accounted  for.  Comparing  (5.44)  with  (5.43),  we  note 
that  because  of  switching  the  conversion  gain  is  reduced  by  1/jc.  In  summary,  the 
output  noise  PSD  at  node  2  can  be  calculated  by  applying  an  increase  offblf\0  and  a 
reduction  of  %  to  Snl  as  given  in  (5.1 14).  Hence  S„2  xs  given  by  [2] 

S„2  =  4kTRon-  x  (fb/fh)  (5.115) 

71 

We  can  also  calculate  the  variance  of  the  noise,  ct„2  First  notice  that//,  is  still  1/ 
(4  x  /^onCioad)-  However,  because  of  switching  the  entire  bandwidth  of  interest  is/i0 
and  not//,.  If  we  substitute  (5. 1 12)  to  (5. 1 15)  and  integrate  S„2  throughout  the  entire 
bandwidth  of  interest  (  =  /lo  )  to  find  the  variance  of  the  noise,  ct„22,  we  note  that 
^on>/io  are  canceled  and  that  an22  is 


al 


\      ,f      t       WTRon      (fb\      AkTRon  1 

Sn2  df  =fl0  X  X       —       =  X  — — - 

71  \floJ  71  4RonC, 


0 


otr^  load 


1        kT 
=  -x^-  (5.116) 

71        '-load 

Note  that  this  is  similar  to  the  a„  given  in  (5.112)  except  that  they  differ  by  a 
factor  of  (1/ti)  because  of  the  reduction  in  conversion  gain  in  a  sampling  mixer  due 
to  switching. 

Next  we  can  find  input  referred  noise,  Ndev,  by  substituting  S„2  derived  in  (5. 1 15) 
and  Gc  derived  in  (5.44)  into  the  following  formula: 

Ndevice  =  ^  (5.117) 

Substituting  A/dev  so  obtained  into  (5.35)  will  allow  us  to  get  the  NF. 
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As  a  final  note,  for  a  subsampling  mixer  the/;,//lo  factor  in  (5.1 15)  is  much  larger 
than  that  for  a  normal  sampling  mixer.  Hence  S„2  is  substantially  larger,  which 
means  that  a  subsampling  mixer  has  a  much  poorer  noise  performance. 


5.11     Extrinsic  Noise  in  Single  Ended  Sampling  Mixer 

There  are  two  primary  noise  sources  in  any  mixer.  One  is  the  intrinsic  noise  from 
the  mixer  itself,  which  has  been  the  primary  focus  of  our  discussion  in  Chapters  4 
and  5  so  far.  In  addition,  any  extrinsic  noise  from  the  RF/LO  port  will  directly  affect 
the  output  noise.  In  this  section  we  will  briefly  examine  this  extrinsic  noise  as  well. 
As  it  turns  out,  extrinsic  noise  from  the  RF  port  can  be  analyzed  in  much  the  same 
way  as  intrinsic  noise  of  the  mixer.  Accordingly,  we  concentrate  our  discussion  on 
the  extrinsic  noise  from  the  LO  port. 

Extrinsic  noise  from  the  LO  port  can  come,  for  example,  from  phase  noise  of  the 
frequency  synthesizer.  In  both  switching/sampling  mixers,  this  noise,  which  comes 
from  the  LO  port,  causes  the  MOS  conductance  to  fluctuate,  hence  producing  noise 
at  the  output  and  thus  degrading  the  NF  of  the  mixer.  Notice  that  this  will  happen 
even  if  we  assume  that  the  MOS  conductance  itself  does  not  have  thermal  noise; 
that  is,  there  is  no  intrinsic  noise.  In  addition  if  there  is  adjacent  channel  interfer- 
ence at  the  input  (RF  port),  this  extrinsic  noise  from  the  LO  port  can  mix  this 
interference  down  to  the  same  IF  as  the  desired  signal,  thus  degrading  the  SNR. 

In  this  chapter  we  focus  our  discussion  on  the  first  impairment.  The  issue 
involving  mixing  the  interference  by  LO  noise  investigated  further  in  Chapter  8. 


5.11.1     Vto  not  Switching  Case 

In  general,  without  switching,  the  mixer  (both  active  and  passive)  can  be  described 
as  an  LTI  circuit.  Hence  the  analysis  is  best  carried  out  in  the  frequency  domain. 
Therefore,  the  output  noise  PSD  at  the  IF  port  can  be  calculated  through  multiplying 
the  LO  noise  PSD  by  the  power  gain  from  the  LO  port  to  the  IF  port. 

Let  us  assume  that  the  LO  port  has  an  input  spectral  density  given  by  5n_i0_input. 
Let  us  further  assume  that  there  is  no  RF  signal.  Then,  as  an  example,  Sn_io_output> 
the  output  noise  PSD  of  the  sampling  mixer  described  in  Figure  5.10  due  to  this 
noise  at  the  LO  port,  is  given  by 


^n-lo-output  —  ^n-lO-input§m_triode\         /-*         /  (D.lloJ 


Here  gm_tri0de  is  the  transconductance  of  Mx  operating  in  the  triode  region. 
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5.11.2     Vi0  Switching,  Both  Special  and  General  Cases 

Now  let  us  take  a  look  at  the  case  when  Vlo  is  switching.  For  the  Gilbert  mixer  and 
switching/sampling  mixers,  we  can  apply  (4.134)  of  Chapter  4  and  (5.36)  of  this 
chapter,  respectively,  to  find  the  output  noise  PSD,  S„_i0_output.  S„2  in  (4.134)  of 
Chapter  4  and  Sn\  in  (5.36)  are  replaced  by  S«_io_input-  Similarly,  S„s  in  (4.134)  of 
Chapter  4  and  S„2  in  (5.36)  are  replaced  by  S„_i0_0utput- 


5.11.3     Special  Vi0  Switching  Case;  Vto  has  Finite  Rise 
and  Fall  Time 

Even  though  Sn_i0_0utput  f°r  switching/sampling  and  Gilbert  mixers  incorporating 
finite  rise  and  fall  time  can  all  be  derived,  the  SrBj0_ontpUt  of  the  sampling  mixer 
incorporating  finite  fall  time  is  particularly  important  because  its  output  value  is 
determined  exactly  at  the  falling  edge  of  the  LO  waveform.  How  do  we  derive  this 

c  9 

«_lo_output  ■ 

Let  us  first  observe  that  for  a  sampling  mixer,  since  the  mixer  works  in  the 
sampled  domain,  we  are  actually  more  interested  in  the  variance  of  the  output  noise 
process  at  the  sampling  instant  rather  than  S«_io_0vitput  throughout  the  entire  LO 
period.  We  should  now  denote  the  noise  process  at  the  LO  port  as  Ni0jnput  and  the 
resulting  output  noise  process  at  the  IF  port  as  iV/0_output.  We  would  further  denote 

2 
_input 


the  variance  (or  mean  square  voltage)  at  the  LO  port  as  (7n_io_input  and  the  resulting 
output  variance  (or  mean  square  voltage)  at  the  IF  port  as  an_i0_output  The 
variances  have  the  unit  V2.  It  is  cTn_iooutput2  that  we  are  interested  in. 

Some  important  properties  of  this  variance  as  well  as  iVfo_output  can  be  derived 
using  time  domain  method  based  on  the  stochastic  differential  equations  (SDE) 
approach.  For  example,  iV/0_0Utput  is  a  cyclostationary  process.  This  is  not  surprising 
as  the  mixer  is  an  LPTV  system.  The  an_iooutput  ,  the  output  variance,  is  varying  at 
twice  the  input  frequency  at  the  RF  port,  <Drf. 

For  this  mixer  we  assume  that  (4.150),  Chapter  4,  is  satisfied.  We  present  a  few 
results  on  the  LO  noise  of  this  mixer.  Let  us  denote  the  sample  time  as  0  and  let  us 
assume  that  Vrf  is  given  by  Vxi  —  Art-  cos(corf(t  —  t0)).  Hence  Vrto  is  the  input  signal  at 
the  sampling  instant  and  is  given  by  Vrfo  =  Ar[  cos(corft0).  The  DC  term  of 
an_io_output  >  <^n_io_output_dc  <  is  given  as  [2] 

a2  = Arf     C°rfC'oad     n2  fSIIQ") 

n_lo_output_dc  a      ,  f-.j  t/\2     n_lo_inpiit  y-J-liy) 

t     K(VG  —  Vt) 

Here  VG  is  the  gate  voltage. 

The  important  quantity,  however,  is  the  sampled  output  noise's  variance  due  to 
the  LO  noise.  We  denote  this  as  cjn_i0_output  s2.  The  sampling  occurs  during  the 
falling  edge  of  Vlo.  During  the  falling  edge  M;'s  resistance  drops,  in  proportion  to 
the  falling  rate  of  the  LO  voltage.  Now  since  there  is  noise  on  the  LO  port,  the 
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Fig.  5.20    Receiver  front  end  architecture  for  Numerical  example  5.2 


voltage  source  is  noisy.  This  noisy  voltage  source  not  only  causes  the  value  of  Ron 
to  decrease,  but  also  fluctuates.  The  mixer  output  voltage  tries  to  track  the  input 
voltage,  but  governed  by  the  RonCioa(i  time  constant.  Since  Ron  is  now  fluctuating, 
the  output  voltage,  during  tracking,  fluctuates  as  well.  At  the  sampling  instant,  this 
uncertainty  is  stored.  This  uncertainty,  or  noise,  is  the  cjn_iooutput  s2  defined  previ- 
ously. Again  let  us  denote  the  sample  time  as  0  and  let  us  assume  that  Vrf  is  given  by 
Vrf  =  Arf  cos(rorf  (t  - 10)).  cjn_lo_ou,put;S2  is  given  as  [2]: 


2 

n  Jo  -Output  ,5 


te-^te  I K^W>-«3  (5-120) 


where  Tf  is  the  fall  time  and  VG  is  the  gate  voltage.  The  preceding  results  for  the 
sampling  mixer  [(5.119)  and  (5.120)]  depend  on  the  use  of  SDE.  The  derivations 
use  a  time  domain  (as  opposed  to  a  frequency  domain)  method.  Due  to  the  extensive 
background  needed  on  SDE,  detailed  derivations  of  the  preceding  results  lie  outside 
the  scope  of  this  book  (although  it  is  interesting  to  note  part  of  the  idea  overlap  with 
that  used  to  derive  time-varying  distortion  using  the  time-varying  Volterra  series, 
discussed  in  sub-section  5.9.2.1).  Interested  readers  are  referred  to  the  reference  for 
the  derivation  [2]. 

Numerical  example  5.2. 

This  numerical  example  applies  formulas  derived  in  Sections  5.8  through  5.11. 
In  this  example,  we  calculate  Gc,  IM3,  and  NF  of  both  a  100-MHz  IF  single-ended 
sampling  mixer  and  a  1.9  GHz  RF  single-ended  sampling  mixer  in  a  DECT 
application.  The  architecture  under  consideration  is  shown  in  Figure  5.20.  This 
is  a  modified  form  of  the  heterodyne  architecture  discussed  in  Figure  2.13  of 
Chapter  2.  The  main  modification  consists  of  adding  a  second-stage  mixer  and  an 
A/D  converter.  The  filters  and  demodulator  are  not  shown. 

It  is  assumed  that  the  input  consists  of  an  —83  dBm  signal  together  with  a  —33 
dBm  interference. 

The  following  parameters  are  assumed  in  the  mixer  calculation: 

C  =  0.352  pF,  W/L  =  110wm/0.8Mm,  k  =  100  uA/V2,  VG  =  2.5  V,  V,  =  1.05  V, 
Tf  =  0.4  ns 
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The  mixer  input  is  terminated  by  a  50  Q  resistor.  To  simplify  notation,  G,  Gc,  IM3, 
NF,  N,  voltage,  and  power  levels  in  both  absolute  and  decibel  forms  are  represented 
by  the  same  symbols.  The  proper  interpretation  should  be  clear  from  the  context. 

Case  1:  implement  mixer  2  using  a  sampling  mixer 

Assumptions:  Mixer  1  is  a  Gilbert  mixer  with  power  gain  Gm;xeri  =  0  dB;  IF 
amplifier  is  a  continuous  time  amplifier  with  power  gain  GiF_ampijfier  =  13  dB. 

C0tf  =  271  x  100  x  106  rad/s 

Since  mixer  2  takes  an  IF  input  and  generates  a  baseband  output,  the  input 
variables  will  have  IF  subscripts. 


IM3 

From  Figure  5.20.  we  can  see  that  at  the  input  to  mixer  2  we  have  two  frequency 
components.  One  is  the  signal  and  the  other  is  the  interference,  which  is  the  one  of 
interest  for  /M3  calculation.  The  power  of  the  interference,  denoted  as  /'interferencejf! 
is  given  by 

^interference^  =  Input  interference  +  Glna  +  Gmixerl  +  GIFamplifler  =  -33  dBm  + 
20  dB  +  0  dB  +  13  dB  =  0  dBm 

Hence  Ainterference_if  =  0.316  V 

Following  Table  5.1,  we  have 

Continuous  time  distortion: 

\  2  '      r* 

...  int  erference  -if         J^if^ 

IMj,  = — - — r 

4  k(yG  -  Vtf 

k  =  k  xW/L=  100  mA/V2  x  (110Mm/0.8wm)  =  0.0138A/V2,  VGS-Vt  =  1.45  V 
Substituting,  IM3  =  -77.64  dB 


Time-varying  distortion: 

erference  -if2co^   /7T\  ^A 

'  uy  \yGj 


3Aint  erference  AfioA    /n\  W  I '  C\  ^  I '  Tf\  5/2 

/M3= ^(-J      (-]      j— )       -0.0913 


Substituting,  IM3  =  -120.3  dB. 
Sampling  distortion: 

^int  erference Jf^ff 'V  \ 

Substituting,  IM3  =  -90  dB 

Comparing,  /M3  is  dominated  by  continuous  time  distortion  and  so  IM3  = 
-77.64  dB. 
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Gc and  NF 

We  will  calculate  Gc .,  NF  for  the  Vlo  not  switching  case. 

Applying  (5.43)  to  the  present  case,  ft)rf  becomes  coif.  Hence  we  have 


Gc  = 


l 


jokc  +  R" 


From  (5.11)  Ron  is  given  by 


m      k(VGS-V,-VDS)      100uA/V21^  (1.45V  -0V) 

Notice  that  we  have  assumed  VDs  to  be  practically  0  V  since  the  sampling  mixer 
is  terminated  by  a  capacitor.  Substituting  all  the  relevant  parameters  into  the  Gc 
formula,  we  have  Gc  =  0  dB. 

NF  is  given  from  (5.34)  as: 

»7r^        IV Device  ~r  {J  '  J> Source _ resistance         ,     ,  ™  device 

NF  = =  1  +  - 


^  '  1*  Source -resistance  *-*  '  ** . source  _resis  tan  ce 

From  (5.1 10),  Advice  =  4kTRon.  Nsomce_resisl!ince  =  4kTRs,  Since  Gc  =  1,  G  =  1. 
Substituting  all  these  values  into  the  NF  formula,  we  have  NF  —  1  +  ^ff- . 

We  are  given  that  the  input  resistance  is  50  Q;  hence  Rs  =  50  Q.  Substituting,  we 
have  NF  ^  1  +  1  =  2,  or  3  dB. 


Case  2:  implementing  mixer  1  using  a  sampling  mixer 

In  the  present  case  corf  =  2ji  x  1.9  x  109  rad/s.  As  a  side  note,  in  this  case  t 
amplifier  is  a  sampled  data  amplifier  and  mixer  2  is  another  sampling  mixer. 


IM3 

The  power  of  the  interference  is 

^"interference^  =  Input  interference  +  Gina  =  —33  dBm  +  20  dB  =  — 13  dBm 

Following  Table  5.1,  we  have 
Continuous  time  distortion: 


j*.     interferences/  J^rf^ 


4  k(VG  -  V,)3 

Substituting,  IM3  =  -78.06  dB 
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Time-varying  distortion: 

interference  rMf  /7T\  3/2  f  C\  ^  ( '  Tf\  5/2 

/M3  =      ^™°       f  (-)      ^j      ^j      -0.0913 

Substituting,  IM3=  -69.6  dB. 
Sampling  distortion: 

7M3 


1    /  "■  interference  _if  Wrf  *f 

8  2V~G  . 


Substituting  IM3  =  -64.8  dB. 

At  this  frequency  sampling  distortion  dominates.  Accordingly,  /M3  =  —64.8  dB. 
Notice  that  when  we  go  from  case  1  to  case  2,  Ainterference  has  gone  down,  but  to  has 
gone  up.  Since  time-varying  and  sampling  distortion  increase  as  qj3  and  co2,  hence 
they  finally  become  dominant. 

Gc and  NF 

Again  we  calculate  Gc,  NF  for  the  V\Q  not  switching  case.  We  apply  (5.43)  to  the 
present  case.  Hence  we  have 

l 

JtOrfC 


-■ n  +  Rn 

JO),fC  ° 


Substituting,  we  have  Gc  =  0.82  or  —1.7  dB. 
NF  is  given  from  (5.34)  as 


*7r^        1* Device    '    vJ  *  ^ *  Source _ resistance  ,     ,  ™ Device 

NF  = — =  1 


^  '  -*  *Source_resistance  ^  '  ^source_resistance 

Gc  =  0.82  means  that  G  =  0.67.  Substituting  all  the  relevant  values  into  this  NF 
rmula,  we  have  NF  =  1 
1  +  1.5  =  2.5  or  4  dB. 


formula,  we  have  NF  =  1  +  ^- .  With  R,  =  50  Q  and  Ron  =  50. 15  Q,  we  have  NF 


5.12     Appendix:  Comparisons  of  Sample  and  Hold 
and  Sampling  Mixer 

There  is  an  existing  circuit  called  the  sample  and  hold  (SAH)  that  is  similar  to  the 
sampling  mixer.  Although  an  SAH  shares  many  of  the  properties  of  sampling 
mixer,  it  also  differs  in  some  important  ones.  We  visit  the  difference  and  show 
that  even  though  structurally  the  sampling  mixer  is  similar  to  SAH  and  some  design 
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Fig.  A.l    Single-ended  sample  and  hold  (SAH)  and  sampling  operation 


criteria  are  common,  there  are  other  design  criteria  that  are  markedly  different. 
To  simplify  comparisons,  all  the  design  formulas  are  based  on  the  assumption  that 
Vjo  (or  sampling  clock  Vs  for  SAH)  is  not  switching. 

To  start,  let  us  show  the  operation  of  a  single-ended  SAH,  as  depicted  in  Figure 
A.l.  In  a  SAH  a  sampling  clock,  Vs,  with  frequency  fs,  is  applied  to  the  gate.  Let  us 
review  some  application  for  such  an  SAH,  particularly  for  high-speed  operation.  In 
high-speed  operation  the  input  consists  of  a  wide  baseband  signal  (frequency  fin) 
rather  than  a  narrow  baseband  signal  superimposed  on  a  high-frequency  carrier,  as 
in  the  sampling  mixer  case.  To  satisfy  the  Nyquist  criteria,/,  is  around  2  times  the 
highest  frequency  of  this  wide  baseband  signal.  Hence  within  one  sampling  clock 
period,  Ts(  =  l/fs),  the  Vm  does  not  vary  too  much.  This  is  shown  in  Figure  A.l. 

As  with  the  sampling  mixer,  because  of  Ron  of  M\  and  Csampie(we  are  neglecting 
Cdb  of  MO,  a  SAH  has  a  finite  bandwidth  and  introduces  tracking  error  during  the 
track  mode.  Looking  in  the  time  domain,  this  finite  bandwidth  poses  an  RC  time 
constant  constraint  on  the  acquisition  time  ?acq,  defined  as  the  time  needed  for  the 
Vout  to  settle  to  Vm  (the  degree  of  settling  is,  of  course,  dictated  by  the  required 
resolution).  This  is  of  primary  importance  in  an  SAH  because  here  we  are  interested 
in  the  absolute  accuracy  of  the  sampled  value.  The  requirement  can  be  very  high 
(on  the  order  of  13  to  14  bits),  which  means  that  sampled  error  must  be  small.  The 
facq  (  =  ^onCsampie)  has  to  be  smaller  than  TJ2.  Usually  Ts  is  rather  small  and  hence 
tacq  must  be  made  small,  making  facq  a  primary  design  constraint.  Accordingly,  to 
satisfy  the  sampled  error  requirement,  both  Ron  and  Csampie  should  be  small. 

For  a  MOS  transistor  biased  in  the  triode  region,  Ron  is  given  by  (5.12),  which  is 
repeated  here: 


K-nn 


1 


1 


KVgs-V,)      nC0x(¥)(yGs-Vt) 


(A.  la) 


284  5     Passive  Mixer 


Substituting  (A. la)  into  facq  =  RonC 'sample,  we  can  calculate  the  sampled_error  as 


/        TJ2    \ 
samplecLerror  —  1  —  exp ^ (A.  lb) 


\    KVgs-V,)/ 

Hence  a  small  sampled  error  means  a  large  k  and  a  small  Csampie  for  a  given 
overdrive  voltage  Vq$— Vt. 

Next  we  should  remember  that  so  far  we  have  approximated  Ron  [as  given  in 
(A.  la)]  as  a  linear  resistor.  What  happens  when  we  bring  back  the  nonlinearity?  We 
observe  that  during  the  track  mode  the  nonlinear  Ron  introduces  distortion  in  exactly 
the  same  way  as  in  sub-section  5.9.1.  For  an  SAH,  typically  we  do  not  have  an 
interference  signal  at  the  input.  On  the  other  hand,  preserving  the  signal  waveform 
is  important.  Hence  low  harmonic  distortion  is  of  more  importance  than  low 
intermodulation  distortion.  Furthermore,  if  we  assume  that  we  have  a  differential 
SAH,  HD2  is  small.  Hence  the  harmonic  distortion  of  importance  is  HDj,.  The  HDj 
of  an  SAH  is  the  same  as  the  HD3  of  a  sampling  mixer  derived  with  V\0  not 
switching.  Hence  it  is  given  by  (5.55).  Arf  becomes  Ain,  C  becomes  Csampie,  and 
<Brf  becomes  <Din.  Equation  (5.55),  with  these  modifications,  is  now  written  as: 


in     J  Q^irr^  sample 

T' '  k{vG~v,f 


//J33=-TL-.r   ?  3  (A-2) 


We  conclude  that  a  small  HD3  means  a  large  k  and  a  small  Csampie  as  well. 

Finally,  the  sampled  noise  in  an  SAH,  <j„s  ,  is  the  same  as  the  variance  of  the 
noise  of  a  sampling  mixer  with  V\0  not  switching,  a„  This  is  given  in  (5.113), 
where  Cioacj  becomes  Csampie  in  the  present  case: 

<7?    =  kT/Cs.dmple  (A3) 


5.13     Observations 

Comparing    between    SAHs    and    sampling    mixers,    we    make    the    following 
observations: 

a.  Structurally  they  are  the  same,  both  consisting  of  a  MOS  transistor  followed  by  a 
capacitor. 

b.  Signal  conditions:  There  are  differences  and  similarities.  One  major  difference  is 
that  for  an  SAH  (as  shown  in  Figure  A.l)  we  have  a  wide  baseband  signal 
(frequency  fm)  sampled  by  a  clock  (frequency  /s),  whereas  for  a  sampling  mixer 
(as  shown  in  Figure  5.10),  we  have  a  narrow  baseband  signal  (frequency 
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Fig.  A.2    Relationship  between/im/s  of  a  sample  and  hold  and/in_narrow,/lo,/rf  of  a  sampling  mixer 

/in_narrow)  superimposed  on  a  fast  carrier  (frequency /rf),  mixed  down  by  an  LO 
signal  (frequency  /i0).  What  are  the  frequencies  of  interest?  In  SAH  they  are/in 
and/s.  In  a  sampling  mixer  they  are/rf  (not/innarrow)  and/j0.  Now  what  are  their 
relationships?  Figure  A.2  displays  the  relationship.  For  an  SAH  /in  <  fs  /2(to 
satisfy  Nyquist  criteria).  This  is  shown  in  Figure  A. 2(a).  In  a  sampling  mixer /rf 
>  /i0  (for  downconversion  mixing)  but  typically  they  are  pretty  close.  This  is 
shown  in  Figure  A.2  (b).  So  far,  we  have  talked  about  the  frequency  relationship 
within  an  SAH  and  a  sampling  mixer.  How  about  the  frequency  relationship 
between  the  two?  To  make  a  fair  comparison,  we  assume  that  the  SAH  is  used 
for  high-frequency  application  such  that  its  /in  is  close  to  /rf  of  the  sampling 
mixer.  This  is  evident  when  comparing /in  in  Figure  A.2  (a)  to/rf  in  Figure  A.2 
(b).  Figure  A.2  (c)  and  (d)  are  just  comparisons  of  Ts  to  Tlo. 
c.  Design  requirements:  There  are  differences  and  similarities.  For  an  SAH  the 
designer  is  primarily  concerned  about  sampling  error,  whereas  for  a  sampling 
mixer  the  designer  is  primarily  concerned  about  distortion.  They  both  have  about 
the  same  requirement  on  noise. 


5.13.1     Design  Methodology 

To  save  notations,  sampled_error,  HD3,  a„  5  ,  Gc,  /M3,  and  a„    are  also  used  to 

denote  the  required  sampled_error,  HD3,  a„  s2,  Gc,  IM3,  a„2. 

We  start  our  discussion  on  the  design  methodology  by  stating  typical  design 

specifications  for  both  the  SAH  and  the  sampling  mixer. 

Typical  design  specifications  (SAH):  sampled_error,  HD3,  an  s2 
Now  we  want  to  develop  relevant  design  equations  for  the  SAH. 
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Applying  (A.  lb),  (A. 2),  and  (A.3)  to  satisfying  these  specifications  means 
following  the  design  equations: 

TJ2     , 
exp j, <  sampled ^error  (A.4) 

:'  ^sample  '' 

A22nfinCsample  <HD^ 

4k(vGS  -  v,y 

kT/Csample<als  (A.6) 

Next  let  us  turn  to  the  sampling  mixer. 

Typical  design  specifications  (sampling  mixer):  Gc,  IM3,  ans2 

Again  we  want  to  develop  relevant  design  equations  for  the  sampling  mixer.  For 
simplification  let  us  focus  on  the  V\Q  not  switching  case. 

Applying  (5.43),  (5.57),  and  (5.113)  to  satisfying  these  specifications  means 
following  the  design  equations: 


jtOrfClf)ad 


1 i L 


>GC  (A.7) 


jdrfCioad       k(Vas-V,) 
interference"^*, 

4k(VGS  ~  Vtf 

kT/Cload<o2n  (A.9) 

Next,  we  want  to  look  at  the  design  parameters.  We  can  identify  the  relevant 
design  parameters  for  both  circuits  as:  Csampie,  Cioad,  k,  Vqs  —  Vt.  We  then  classify 
these  design  parameters  according  to  whether  they  are  common  or  distinct  to 
designing  the  two  circuits. 

Design  parameters  that  are  designed  in  a  common  fashion  to  both  circuits: 

Csampie!  Cfoad;  ^GS  ~   '  r. 

a.  Vdd  is  the  same  for  both  SAH  and  sampling  mixers.  This  means  Vqs  —  V,,  fixed 
by  Vdd,  should  be  the  same  for  both  circuits. 

b.  Noise  requirements  are  about  the  same  for  both  SAH  and  sampling  mixers.  This 
means  that  Csampie  and  Cioad  are  found  by  similar  equations:  (A.6),  (A.9). 

Design  parameter  that  is  designed  using  different  approaches  for  two  circuits:  k 
We  next  focus  on  how  this  k  is  designed  in  an  SAH  and  a  sampling  mixer. 
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For  a  SAH  k  is  designed  via  the  following  steps: 

Step  1:  We  start  from  (A.4).  From  application  requirements,  sampled  error  usually 
is  very  small.  Again  from  application  requirements,  as  shown  in  Figure  A.2,  (c),  Ts 
is  very  small.  This  implies  that  it  is  highly  difficult  to  find  a  k  that  satisfies  (A.4). 

Step  2:  Let  us  go  to  (A. 5).  From  application  requirements,  HD3  is  typically 
moderate  in  difficulty  to  satisfy.  Again,  from  application  requirements,  as  shown 
in  Figure  A.2  (a),  fm  is  high.  This  implies  that  it  is  moderately  difficult  to  find  a  k 
that  satisfies  (A.5). 

In  conclusion,  the  strategy  that  we  adopt  is  first  to  design  k  to  satisfy  (A.4),  then 
(A.5)  will  typically  be  satisfied. 

For  a  sampling  mixer,  k  is  designed  via  the  following  steps: 

Step  1:  We  start  from  (A. 7).  From  application  requirements  Gc  is  typically  quite 
easy  to  satisfy.  Again,  from  application  requirements,  as  shown  in  Figure  A. 2(b), 
(Brf  =  27r/rt  is  high.  This  implies  that  it  is  not  that  difficult  to  find  a  k  that  satisfies 
(A.7). 

Step  2:  Let  us  go  to  (A. 8).  From  application  requirements,  /M3  is  usually  moderate 
in  difficulty  to  satisfy.  Again,  from  application  requirements,  as  shown  in  Figure 
A.2  (b),  corf  =  27i/rf  is  high.  This  implies  that  it  is  moderately  difficult  to  find  a  k  that 
satisfies  (A.8). 

In  conclusion,  the  strategy  that  we  adopt  is  first  to  design  k  to  satisfy  (A.8)  and 
then  (A.7)  will  typically  be  satisfied. 

We  summarize  the  key  observation  in  our  present  design  methodology.  The  key 
observation  is  that,  as  opposed  to  traditional  sampled  circuits  such  as  an  SAH, 
which  focuses  on  ?acq  in  the  design,  for  a  sampling  mixer,  distortion  requirement  is 
more  important  in  guiding  the  design  process.  This  justifies  why  in  this  chapter  we 
spend  a  substantial  amount  of  time  looking  into  this  distortion  mechanism,  includ- 
ing the  time-  varying  distortion  and  sampling  distortion  due  to  finite  fall  time. 
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5.1  In  Numerical  example  5 . 1  we  have  not  checked  if  the  model  for  high  frequency 
is  applicable.  Used  the  same  value  as  in  the  example  to  check  if  the 
assumptions  (5.13)  and  (5.14)  are  valid  for  the  high-frequency  case. 

5.2  Calculate  the  conversion  gain  in  the  general  V\0  switching  case  of  a 
switching  mixer  for  DECT  application  using  (5.29).  Use  parameters  given  in 
Numerical  example  5.1.  The  W/L  ratio  is  assumed  to  be  the  value  derived 
in  Numerical  example  5.1,  or  762  um/1  um.  Assume  that  Cox  =  2  fF/um2. 

5.3  Compare  the  differences  between  the  following  switching  mixers:  mixer  in 
Figure  5.1  and  mixer  in  Figure  5.8.  Draw  the  Vif  waveform.  (When  you  draw 
the  Vjf  waveform,  assume  that  the  Vrf  and  V\a  waveforms  are  as  given  in  Figure 
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5.10).  Next  repeat  the  comparisons  between  the  switching  mixer  in  Figure  5.8 
and  the  sampling  mixer  in  Figure  5.10.  What  should  be  the  relative  values  of 
Cext  and  Cioad?  Which  of  these  two  mixers  has  a  poorer  IM{1 

5.4  In  Figure  5.13  derive  the  relationship  between  k2  of  switch  M2  and  kx  of  switch 
My  to  obtain  equal  second  harmonic  distortion  at  high  frequency  (assume  that 
continuous  time  distortion  dominates).  In  this  case,  does  increasing  the  sam- 
pling mixer  switch  M;'s  size  always  enhance  the  performance?  At  high  fre- 
quency is  the  use  of  an  opamp  to  provide  virtual  ground  a  good  way  of  reducing 
the  size  of  M21 

5.5  The  distortion  formulas  for  a  switching  mixer  (as  shown  in  Figure  5.1  and 
Figure  5.8)  or  a  sampling  mixer  (as  shown  in  Figure  5.10),  which  include 
second-order  effects  such  as  threshold  modulation,  velocity  saturation,  and 
mobility  degradation,  have  been  derived  at  low  frequency  and  are  given  by 
the  following,  where  IM3'  is  the  modified  IM3: 

a.  Threshold  Modulation 

IM'3  =  (l  +  Oi)2  x/M3 

where  0i  =  — ,  y 

where  y=  body  effect  parameter 
4>f  =  2  times  the  bulk  potential 
Vs=  source  voltage 

b.  Velocity  Saturation 

1M'3  =  I  1  +  aL  j  f '-  R )  IM3  where  a  =    „2  ^f2Ry     and  j_Lr  =  a  coefficient  that 

helps  model  the  absolute  function  of  the  velocity  I VI  as  a  square  function  (i.e, 
IVI  s=i  (.if  V2),  R  =  impedance  of  load  (resistance  if  load  is  a  resistor, 
impedance  if  load  is  a  capacitor),  gds  =  l/Ron,  and  vsat  =  saturation  velocity. 

c.  Mobility  Modulation  from  Normal  Field 

IM'3  «  [1  —  0(Vgs  —  VT)]IM3,  where  9  is  the  parameter  that  relates  the 
effective  mobility  due  to  the  normal  field  p.eff  to  the  normal  mobility  u  as 
follow:  neff  =  1+804_Vr)  ■ 
Next  assume  the  following  parameters: 


Vas-  VT=  1.45  V 


Cload  =  0.352 pf  ^-  =  imic'=l00>$ork  =  k!\  =  0.0137 £ 

y  =  0.56  <pf  =  0.393  V  VSB  =  2.5  V 

Hf  =  0.5COX  =  2  fF/um2,  Vsat  =  2x  105  m/s 

6  =  0.05/V 

The  rest  of  the  parameters  are  given  in  Numerical  example  5.2. 
Define  the  factors  Fvei,  FBodyEff,  ^Normal,  as  the  ratios  of  IM3  (due  to  second- 
order  effects)//M3(no  second-order  effects).  Calculate  these  factors. 

5.6  Redo  the  IM3  calculation  part  in  the  Numerical  example  5.2,  except  this  time 
include  the  second-order  effects  (i.e,  the  three  short  channel  effects  due  to 
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threshold  modulation,  velocity  saturation,  and  mobility  modulation  as 
discussed  in  Problem  5.5  [and  use  the  same  formula]). 
5.7  This  problem  recalculates  NF  of  the  sampling  mixer  discussed  in  Case  2, 
Numerical  example  5.2  by  using  a  more  sophisticated  noise  model.  First 
extrinsic  noise  from  the  LO  port  is  included.  Also,  the  intrinsic  noise  is  to  be 
calculated  for  the  general  V\0  switching  case.  To  perform  the  calculations,  we 
make  the  following  assumptions.  First,  let  us  assume  that  all  the  parameters 
from  the  Numerical  example  5.2  carry  over.  Second,  we  assume  the  following 
parameters  for  the  sampling  mixer:  Vrf0  =  0,  Alo  =  1  V,  VG  —  Vt  =  1 .45  V,  VG  = 
2.45  V.  Third,  let  us  assume  that  the  conversion  gains,  Hn,  are  the  same  for  both 
zero  and  finite  fall  time.  Furthermore  as  a  simplification,  we  assume  Hn,  n=\, 
2. . .  are  all  identical  and  are  given  by  the  following  formula,  which  is  modified 
from  the  Gc  formula  as  given  by  (5.45).  This  modified  formula  incorporates  the 
Rs  and  is  given  as 


Gc((00ut,COl0) 


Gco(a>oul) 


'2     CO,,,     /  -^ML 

77^7"    exP   2<% 


Gco(a>out) 


oj,„„Ci„ 


1 

COoutCload 


R,  +  R„ 


(a)  Calculate  NF  of  this  1.8  GHz  sampling  mixer  including  cjn_i0_0lltput  s  and 
using  intrinsic  noise  calculated  for  the  general  V\Q  switching  case.  Do  the 
calculation  for  two  cases.  For  Case  1,  assumes  cjnj0_input  s2  in  (5.120) 
comes  from  a  lkQ  resistor  (you  can  imagine  that  the  LO  port  is  terminated 
with  a  1  kQ  resistor).  For  Case  2,  assume  that  CTn_i0_input  s2  comes  from  a  50 
Q  resistor. 

(b)  Repeat  (a)  except  this  time  include  ayo^utp^j,;  .  Again,  use  intrinsic 
noise  calculated  for  the  general  Vi0  switching  case. 
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Chapter  6 

Analog-to-Digital  Converters 


6.1     Introduction 

Traditionally  in  a  receiver  (as  shown  in  Figure  2.1),  upon  mixing  the  input  signal 
from  RF  to  IF,  subsequent  demodulation  can  be  performed  in  a  couple  of  ways, 
depending  on  the  kind  of  modulation  used.  In  the  case  of  DECT,  since  the  input 
signal  is  phase  modulated  (GMSK  for  DE4CT),  MSK  demodulation  for  the  digital 
encoded  phase  information  should  be  performed.  This  can  be  done  in  a  manner  akin 
to  demodulating  a  QPSK  signal.  There  are  three  common  ways  to  demodulate  a 
QPSK  signal:  FM  discriminator,  IF  detection,  and  baseband  detection  [9].  Each  of 
these  methods  can  be  done  entirely  in  the  analog  domain  or  by  doing  first  an  analog 
to  digital  (A/D)  conversion  at  IF  and  then  implementing  these  methods  digitally 
using  digital  signal  processing  (DSP).  In  this  sense  the  A/D  converter  becomes  part 
of  the  demodulator.  Using  an  A/D  converter  in  a  demodulator  obviously  is  benefi- 
cial in  terms  of  being  able  to  integrate  the  post  A/D  conversion  signal  processing 
function  on  chip  easily. 

Instead  of  performing  A/D  conversion  right  before  demodulation,  we  can  per- 
form this  conversion  earlier  on  in  the  receiver  front  end.  Referring  again  to  Figure 
2.2,  an  A/D  converter  can  be  placed  inside  the  front  end  and  used  to  digitize  the 
signal  at  the  early  stage  of  the  front  end.  For  example,  if  A/D  conversion  is 
performed  right  after  the  LNA,  then  BPF2,  mixer,  BPF3,  and  the  IF  amplifier  can 
all  be  implemented  digitally  using  DSP.  In  this  sense  the  A/D  converter  becomes 
part  of  the  front  end  and  is  responsible  for  processing  (conditioning)  the  received 
signal/A WGN/interference  before  admitting  it  to  the  demodulator.  The  resulting 
front  end  architecture  is  shown  in  Figure  P.2(a),  Chapter  2.  Using  an  A/D  converter 
in  the  front  end  obviously  has  the  same  benefit  as  having  the  converter  in  a 
demodulator.  In  addition,  since  it  allows  the  post  A/D  conversion  signal  processing 
function  to  be  done  digitally,  it  allows  more  flexibility  in  implementing  the  receiver 
front  end. 
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In  this  chapter  we  first  review  the  common  methods  of  demodulation.  We  then 
discuss  A/D  converters  most  suitable  to  be  used  in  the  demodulator  as  well  as  in  the 
front  end.  Two  such  A/D  converters,  the  low-pass  and  bandpass  sigma-delta 
modulators,  are  investigated  in  detail.  In  both  cases,  design  procedures  are 
developed  and  design  examples  for  using  them  in  a  DECT  receiver  are  presented. 


6.2     Demodulators 


In  this  section  we  review  the  three  common  methods  of  demodulations. 


6.2.1     FM  Discriminator  (Incoherent) 

Let  us  now  study  the  first  method,  FM  discriminator  [9],  which  is  illustrated  in 
Figure  6.1. 

This  demodulation  is  done  incoherently  at  passband.  The  FM  discriminator  has  a 
phaseshift  network  that  will  introduce  a  delay  of  t0  to  the  carrier  and  a  delay  of  tx  to 
the  phase.  The  input  to  this  network  contains  a  filtered  (filtered  by  the  BPF)  and 
limited  (by  the  limiter)  version  of  the  original  QPSK  modulated  IF  signal  SqpskW. 
We  can  represent  this  limited  and  filtered  version  by  an  equivalent  FM  signal  [9], 

5FM(t),  where  SfmW  =  cos[<dc?  +  4>(t)].  Here  (oc  is  the  carrier  frequency  (equals  <Dif). 

t 
By  definition,  the  phase  of  SfmW,  <K0>  *s  defined  as  §{t)  —  1%fA  \xm {X)dX,  where 

o 
xM(t)  is  the  modulating  signal,  and  fA  is  the  modulation  index.  Conversely 
we  havec|)(7)  =  2nfAxM(t).  Upon  passing  SFM(t)  through  the  phaseshift  network, 
the  output  will  be  cos[(oct—  cocf0+  <j)(t—  ?i)].  Now  if  we  design  t0  such  that 
coct„  =  90°,  then  this  output  becomes  sin[wct  +4>(f— fj)].  As  shown  in  Figure  6.1, 
this  is  multiplied  by  the  original  5FM(t),  which  equals  cos[coct  +  <fi(t)].  Upon  being 
filtered  by  the  LPF,  the  output  is  yD(t)  =  sin  [(j>{t)-(l>(t-t{)]  ^  (/)(0-</>(?-?i).  The  last 
approximation  is  valid  if  we  assume  ti  is  small  enough  that  \(f>(t)  —  (j>(t  —  ?i)|  -€1  tc. 
Under  this  assumption  we  also  have  (f>(t)  =  [(f>{t)— (f){t—ti)~\ltx.  Hence  [(p(t)— <f)(t—  f;)] 
=  <p(t)  tx  —  2nffJiXM(t).  Therefore,  yo(t)  =  2nfAtiXM(t)  and  we  extract  the  original 
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Fig.  6.1    FM  discriminator,  analog 
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Fig.  6.2    FM  discriminator,  digital 

FM  modulating  signal  xM(t)  and  generate  the  demodulated  FM  signal  yo(t)  out  of  it. 
Now  we  can  sample  this  demodulated  FM  signal  and  obtain  yo(n),  where  n  is  the  time 
index.  This  yo(n),  which  represents  the  phase  shift  to  the  proper  binary  bits,  can  be 
mapped  to  obtain  the  demodulated  QPSK  signal,  XBB(n).  Let  us  now  represent  the 
demodulated  signal  XBB{n)  as  (xMI(n)  Xmq(k))  where  xM/(n)  xMq{ji)  are  the  sampled  /, 
Q  branches  of  the  message  xM(f).  Hence  we  can  map  yo{n)  to  (xMI(n)  xMQ(n)).  For 
QPSK  (xmM  xMQ{n))  =(11)  if  yD(n)  =  tt/4,  (xMI(n)  xMQ(n))  =  (0  1)  if  yD(n)  =  3n/4, 
{xMI{n)  xMQ{n))  =  (0  0)  if  yD(ri)  =  -3tt/4,  (xMI{n)  xMQ(n))  =  (10)  if  yD(n)  =  -rc/4. 

This  FM  discriminator  is  usually  implemented  in  the  analog  domain. 
As  an  example,  the  IF  receiver  LMX2240,  made  by  National  Semiconductor, 
is  one  of  the  commercial  realizations  using  this  approach.  This  chip  consists  of  a 
hard  limiter  at  the  input  with  an  input  impedance  of  150  Q  and  an  —80  dB 
sensitivity  for  the  IF  signal.  A  Gilbert  quad  mixer  (similar  to  the  one  covered  in 
Chapter  4)  is  used  as  the  multiplier  in  Figure  6.1.  To  get  the  phase-shifted  signal 
an  external  tank  circuit  is  required  whose  bandwidth  is  approximately  1%  of  the 
IF  frequency  and  a  steep  phase  response  is  required.  As  a  result,  this  approach 
suffers  from  (a)  the  need  to  have  external  capacitors  and  inductors  for  its  tank 
circuit,  and  (b)  the  need  to  have  a  narrowband  analog  filter.  These  analog 
components  are  expensive  and  are  not  easily  integrable  on  a  chip. 

To  circumvent  these  problems,  the  approach  shown  in  Figure  6.2  can  be  used.  In 
Figure  6.2  the  IF  signal  Sqpsk(0  is  nrst  digitized  by  performing  an  A/D  conversion 
at  the  passband  using  a  passband  analog  to  digital  converter  (ADC).  This  generates 
a  digital  version,  Sqpsk(w)-  A  digital  version  of  the  FM  discriminator  described  in 
Figure  6.1  is  then  used  to  convert  this  Sqpsk(«)  to  XBB.  This  approach  pushes  more 
of  the  processing  into  the  digital  domain  and  is  desirable. 


6.2.2    IF  Detection  (Coherent) 


Demodulation  can  also  be  done  coherently.  If  performed  at  IF  it  is  normally  done 
using  a  matched  filter  [9].  As  before,  the  demodulation  can  be  done  in  both  the 
analog  and  digital  domains.  The  desirable  approach,  the  digital-based  method, 
consists  of  converting  Sqpsk(0  to  5qpsk(«)  using  a  passband  ADC,  as  shown  in 
Figure  6.3.  The  multiplication,  low  pass  filter  function  needed  in  the  IF  detection 
will  then  be  done  digitally  to  generate  the  XBB. 
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6.2.3    Baseband  Detection  (Coherent) 


If  coherent  detection  is  performed  at  baseband,  it  is  typically  done  using  a  matched 
filter  [9].  As  with  coherent  detection  at  IF  it  can  be  done  in  both  the  analog  and 
digital  domains,  where  again  the  digital  approach  is  preferrable.  This  digital 
approach  is  shown  in  Figure  6.4,  where  the  /,  Q  branches  are  depicted.  Notice  the 
similarity  of  Figure  6.4  to  Figure  1.7,  except  that  the  ADCs  are  shown  here. 
Referring  to  Figure  6.4,  the  incoming  IF  signal  at  (oc  —  mi{  is  mixed  by  the  LO 
signal  at  colo.  This  colo  frequency  is  picked  to  be  at  the  same  frequency  as  mc.  After 
analog  mixing,  a  low-pass  filter  is  used  to  eliminate  the  frequency  component  at 
a>if  +  c«i0,  leaving  behind  only  the  baseband  signal.  Notice  that  this  mixing  function 
also  performs  the  multiplying  function  necessary  for  detection.  A  baseband  A/D 
converter  is  then  used  to  convert  the  analog  signal  to  its  digital  form,  and  the 
baseband  digital  signal  processor  performs  the  rest  of  baseband  detection  in  the 
digital  domain,  at  baseband.  After  the  detection,  equalization/decoding  can  also  be 
done.  Equalization  has  already  been  discussed  in  chapter  1.  This  combats  ISI 
resulting  from  fading.  Decoding  can  be  further  used  to  reduce  noise. 
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Fig.  6.5    Baseband  detection,  hybrid 


Getting  back  to  A/D  converter,  comparing  the  approaches  in  Figure  6.2  through 
Figure  6.4,  the  major  difference  is  the  requirement  on  these  ADCs.  The  requirement 
is  very  high  in  Figure  6.2  and  6.3  since  the  A/D  converter  has  to  digitize  directly  at 
IF.  This  makes  the  design  of  the  A/D  converter  a  big  challenge,  especially  at  high 
IF.  Therefore,  usually  Figure  6.4  is  reserved  for  high  IF  (up  to  400  MHz)  and 
Figures  6.2  and  Figure  6.3  for  low  IF  architectures  (10  MHz).  This  is  particularly 
true  if  we  observe  that  the  performance  level  (especially  distortion)  of  analog 
mixers  at  100  MHz  can  be  substantially  better  than  A/D  converters  at  the  same 
frequency  range.  Recently  an  improvement  on  the  baseband  detection,  digital 
approach,  denoted  as  the  hybrid  approach  [7],  was  proposed  and  is  shown  in 
Figure  6.5.  Comparing  Figures  6.4  and  6.5,  everything  is  the  same  except  that  the 
mixer,  baseband  ADC  is  replaced  by  a  feedback  structure.  This  feedback  structure 
does  A/D  conversion  as  usual,  but  then  takes  its  output,  performs  D/A  conversion, 
and  subtracts  the  IF  signal  from  this  analog  signal.  In  spite  of  its  seemingly 
complicated  structure  in  practical  implementation  a  lot  of  simplifications  can  be 
performed,  resulting  in  a  markedly  simplified  final  form.  The  important  point  is  that 
because  the  components,  most  notably  the  analog  mixers,  are  embedded  in  a 
feedback  loop,  all  nonidealities  are  suppressed  by  the  feedback  action  and  thus 
the  approach  offers  a  significant  advantage.  Because  of  the  scope  of  this  chapter, 
this  architecture  is  not  covered.  Interested  readers  are  referred  to  the  reference  [7]. 
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6.3     A/D  converters  Used  in  a  Receiver 

Most  of  the  discussions  in  this  chapter  are  focused  on  both  the  baseband  and 
passband  ADCs.  In  digital  radio  applications  [1],  the  signal  to  be  digitized  typically 
can  have  a  center  frequency  that  varies  from  a  few  megahertz  to  a  few  hundreds  of 
megahertz,  with  bandwidth  varying  from  a  few  kilohertz  to  tens  of  megahertz. 
The  A/D  converter  used  in  this  application  needs  to  handle  signals  in  these 
frequencies  and  with  these  bandwidths  while  maintaining  a  resolution  of  up  to  the 
70—80  dB  (12—13  bit)  range.  First,  we  look  at  the  different  A/D  converter 
architectures  and  comment  on  which  one  is  promising  for  wireless  communication. 

6.3.1     Wideband  Versus  Narrowband  AID  Converters 

There  are  two  general  ways  of  doing  A/D  conversion  for  wireless  communication: 
wideband  and  narrowband  conversion  [9].  Oversampled  A/D  converters  such  as 
sigma-delta  (SA)  converters  falls  in  the  narrowband  category.  Wideband  converters 
will  include  architectures  such  as  pipelined  and  flash  converters. 

Figure  6.6  summarizes  the  sampling  frequency  and  resolution  of  different  A/D 
converters.  In  the  present  ADC  application,  the  concept  of  resolution  is  identical  to 
dynamic  range.  Hence  an  ADC  with  a  80  dB  resolution  sweeps  through  a  dynamic 
range  of  80  dB  while  maintaining  an  acceptable  output  SNR.  Resolution  can  also  be 
specified  in  number  of  bits.  In  addition  the  concept  of  sampling  frequency  (fs)  needs 
to  be  clarified.  Sampling  frequency  normally  is  related  to  the  bandwidth  of  the  input 
signal  an  ADC  can  digitize.  Typically  the  sampling  frequency  is  equal  to  two  times 
this  bandwidth,  as  governed  by  the  Nyquist  sampling  theorem.  However 
in  oversampled  ADC  that  performs  narrowband  A/D  conversion,  because  of 
oversampling,  the  real  sampling  frequency  is  much  higher  than  two  times  the 
bandwidth.  To  clarify,  in  Figure  6.6  the  sampling  frequency  quoted  for 
the  oversampled  ADC  is  actually  just  two  times  the  input  signal  bandwidth  and  is 
different  from  its  real  sampling  frequency.  For  the  rest  of  the  ADCs,  the  sampling 
frequency  quoted  in  Figure  6.6  is  the  real  sampling  frequency. 

In  wireless  communication,  there  is  also  a  stringent  power  requirement  for 
A/D  converters.  Figure  6.7  compares  power  consumption  in  various  wideband 
A/D  converters.  In  Figure  6.6,  we  can  see  that  flash  A/D  converters  have  the  highest 
sampling  rate  (input  signal  bandwidth)  but  lowest  resolution.  The  oversampled 
A/D  converters  have  the  highest  resolution  but  the  lowest  input  signal  bandwidth. 
Between  them  there  are  pipelined  A/D  converters,  which  offer  a  trade-off  between 
bandwidth  and  resolution.  As  shown  in  Figure  6.7,  the  flash  A/D  converters  have 
the  highest  power  consumption.  Because  there  are  very  few  oversampled 
and  pipelined  A/D  converters  operating  at  an  8-bit  resolution,  they  are  not  presented 
in  this  figure.  In  general,  oversampled  A/D  converters  have  the  same  order  of  power 
consumption  as  the  algorithmic  ones  and  pipelined  A/D  converters  have  power 
consumption  similar  to  subranging  A/D  converters. 
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Fig.  6.6    Resolution  versus  sampling  frequency  for  A/D  converters  implemented  using  different 
architectures 


Typically,  for  wide  signal  bandwidth  applications  (such  as  CDMA,  or  wireless 
video),  Nyquist  rate  ADCs  are  used.  One  such  ADC  using  a  pipelined  architecture 
has  been  developed  [8] .  On  the  other  hand,  for  narrow  signal  bandwidth  applications 
(such  as  AMPS,  GSM)  oversampled  ADCs  are  preferable.  As  mentioned  previously, 
since  power  is  one  of  the  most  important  issues  in  wireless  application,  architectures 
that  have  high  power  dissipation  are  avoided.  With  this  in  mind  we  make  the 
following  observations:  for  wide  signal  bandwidth  applications,  the  pipelined  con- 
verter is  appealing  because  its  power  dissipation  is  the  lowest  among  high-speed 
Nyquist  rate  A/D  converters.  For  narrow  signal  bandwidth  applications, 
oversampled  A/D  converters  (of  which  the  sigma-delta  modulator  is  an  example) 
are  the  prime  choice.  In  particular,  low-  pass  and  bandpass  sigma-delta  modulators 
are  usually  adopted  in  high  and  low  IF  A/D  conversions,  respectively.  In  this 
chapter,  in  order  to  narrow  the  scope  we  focus  on  narrowband  applications  and 
hence  we  concentrate  only  on  sigma-delta  modulators.  In  the  following  sections,  the 
important  design  parameters  for  each  architecture  are  explained  and  a  design 
procedure  is  presented  to  help  guide  readers  in  obtaining  the  required  design 
parameters.  This  is  followed  by  a  detail  design  example. 
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Fig.  6.7    Power  versus  sampling  frequency  for  an  8-bit  A/D  converter  implemented  in  different 
architectures 

6.3.2    Narrowband  AID  Converters:  General  Description 


We  can  understand  the  principle  of  sigma-delta  modulators  by  starting  off  with  a 
flash  A/D  converter.  A  flash  ADC  generates  quantization  noise  in  the  process  of 
performing  an  analog  to  digital  conversion.  A  low-pass  (bandpass)  sigma-delta 
modulator  uses  a  low-pass  (bandpass)  filter  (hitherto  called  the  loop  filter)  in  front 
of  such  a  flash  type  ADC  (hitherto  called  an  internal  quantizer)  to  process  the 
quantization  noise  generated  by  this  internal  quantizer.  This  is  achieved  by  feeding 
back  the  internal  quantizer  output  to  the  input  of  the  loop  filter.  By  enclosing  this 
filter/quantizer  combination  inside  a  feedback  loop,  we  shape  the  quantization  noise 
in  a  frequency  selective  way,  resulting  in  the  noise  being  suppressed  at  a  frequency 
band  of  our  choice  (DC  for  low-pass  and  IF  for  bandpass  ZA  modulators). 
For  example,  the  lowpass  EA  modulator  can  be  used  to  implement  the  baseband 
ADC  in  Figure  6.4  and  in  Figure  P.2(a),  Chapter  2.  Similarly,  the  bandpass 
ZA  modulator  can  be  used  to  implement  the  passband  ADC  in  Figure  6.2  and 
Figure  6.3.  Meanwhile,  since  the  feedback  loop  broadbands  the  frequency  response 
of  the  loop  filter,  the  input  signal  is  not  subjected  to  any  frequency  shaping  and 
passes  unattenuated  to  the  output. 
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All  sigma-delta  modulators  (low-pass  and  bandpass)  can  be  broadly  categorized 
according  to  the  number  of  bits  in  the  internal  quantizer  and  the  order  of  the 
loop  filter,  which  dictates  the  order  of  the  modulator.  In  addition  each  of  these 
individual  modulators  can  be  cascaded  and  form  a  multistage  modulator. 
We  review  some  of  the  basics  of  sigma-delta  modulators,  starting  from  a  single- 
stage  first-order  modulator.  Subsequently,  we  go  through  the  second-order  and  then 
higher-order  modulators  and  their  variations.  We  focus  on  architectures  that  are  of 
most  relevance  to  wireless  communication. 

6.4    Low-Pass  Sigma-Delta  Modulators 

An  example  of  using  a  low  pass  sigma-delta  modulator  can  be  found  in  Figure  6.4 
and  in  Figure  P. 2(a)  Chapter  2.  Here,  because  the  baseband  ADC  operates  on  a 
baseband  signal,  a  low-pass  sigma-delta  modulator  is  used.  In  the  following,  we 
discuss  the  various  low-pass  sigma-delta  modulators,  starting  with  the  simplest  one: 
a  first-order  modulator. 

6.4.1     First  Order  Modulator 

A  first-order  low  pass  (LP)  sigma-delta  modulator  is  shown  in  Figure  6.8. 
This  is  a  baseband  ADC,  which  takes  the  continuous  time  analog  signal  x(f)  and 
generates  a  discrete  time  digital  signal  y(n),  where  n  is  the  time  index.  This 
modulator  implements  the  loop  filter  using  an  integrator  with  gain  k  and  an  internal 
quantizer  using  a  1-bit  flash  ADC.  It  takes  the  continuous  time  input  signal  x(t), 
samples  it  at  a  sample  frequency  fs,  generates  a  discrete  time  (but  continuous 
amplitude)  signal  x(n).  The  sampling  function  is  implemented  by  the  sample  and 
hold  (SAH)  described  in  the  appendix  of  Chapter  5.  This  x(n)  is  next  discretized  in 
amplitude  by  the  1-bit  ADC  to  generate  the  digital  code  y{n).  This  y(ri)  consists  of 
a  bunch  of  Is  and  Os  since  the  ADC  is  only  a  1-bit  ADC.  The  stream  of  Is  and  Os  is 
a  pulse  density  representation  of  the  input  signal  x(n). 

We  will  now  go  through  an  example  of  how  this  modulator  works.  First  we 
assume,  for  illustration  purposes,  that  integrator  has  a  gain  k  =  1 .  We  also  assume 
that  x(t)  has  a  constant  DC  value  of  0.5  [which  means  x(n)  =  0.5  for  all  n]  and  that 
initially  w(0)  =  0.  Finally  the  threshold  of  the  1-bit  ADC  is  set  such  that  y(n)  =  0  for 
w(n)  <  0.5  and  y(n)  =  1  for  w(n)  >  0.5.  Since  w(0)  =  0,  then  y(0)  =  0.  When  the 
time  index  n  goes  from  0  to  1,  we  feed  the  value  y(0)  back  and  update  m(1)  as 
follows:  m(1)  =  x(l)-y(0)  =  0.5.  Hence  w(l)  =  w(0)  +  kx  w(l)  =  w(0)  +  k(1)  = 
0  +  0.5  =  0.5.  Since  w(l)  =  0.5,  it  exceeds  the  threshold  and  y(l)  becomes  1.  Nextn 
goes  from  n  =  1  to  n  =  2.  Then  u(2)  =  —0.5,  w(2)  =  0,  and  y{2)  =  0.  In  summary,  y 
(n)  goes  from  0  to  1  and  back  to  0  again.  If  we  continue  on  with  this  iteration  we  will 
find  that  y(n)  =  1,0,  1,0,  etc,  and  so  its  average  value  is  exactly  0.5  or  the 
same  as  the  input  value  of  x{t).  Therefore,  an  A/D  conversion  is  achieved,  though 
in  an  average  sense. 
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Fig.  6.8    First  order  modulator 


From  a  frequency  domain  viewpoint,  since  the  integrator's  frequency  response 
has  infinite  gain  at  DC,  the  loop  gain  is  infinite  at  DC  and  therefore  the  DC 
component  or  the  average  of  the  output  from  the  feedback  1-bit  DAC  will  be 
identical  to  the  DC  component  of  the  input  signal  x(n).  Reverting  to  the  time  domain 
viewpoint,  this  means  that  even  though  the  quantization  error,  e(n)  =  y(n)— w(ri),  at 
every  sample  is  large  because  of  the  use  of  a  1-bit  quantizer,  the  average  of  the 
quantized  signal,  and  therefore  the  modulator  output  y(n),  tracks  the  signal  x(n).  This 
average  is  computed  by  a  digital  decimation  filter  that  is  not  shown  here. 

In  general,  the  quantization  error  decreases  (or  the  resolution  of  the  modulator 
increases)  when  more  samples  are  included  in  the  averaging  process  or  as 
the  oversampled  ratio  (OSR),  defined  as  the  ratio  of  sampling  frequency/,  to  two 
times  the  signal  bandwidth  /bw,  increases.  Consequently,  the  resolution  of  the 
modulator  is  a  function  of  the  OSR.  The  principle  of  operation  of  sigma-delta 
modulators  relies  on  this  fundamental  trade-off  between  resolution  and  time.  Since 
in  wireless  communication  we  typically  have  a  narrowband  signal  centered  at 
a  high-frequency  carrier  (e.g,  /bw  of  AMPS  is  around  20  kHz  with  fc  centred 
at  around  900  MHz),  sigma-delta  modulation  is  a  natural  choice  for  digitizing 
these  narrowband  signals. 

On  closer  examination  of  the  quantization  process  in  a  first-order  sigma-delta 
modulator,  we  can  see  that  when  compared  with  Nyquist  rate  A/D  converters,  the 
quantization  error  is  a  differential  error.  In  other  words,  the  modulator  tries  to 
cancel  the  error  by  subtracting  the  quantization  error  from  two  adjacent  samples. 
This  principle  of  reducing  errors  by  exploiting  the  statistics  between  samples  can  be 
extended  to  higher-  order  modulators,  where  more  of  past  error  samples  are 
involved  in  the  cancellation  process  to  reduce  the  overall  error.  Viewed  from  the 
frequency  domain,  this  difference  operation  acts  to  attenuate  the  quantization  noise 
at  low  frequencies,  thus  shaping  the  noise. 

Next  let  us  develop  a  model  of  this  first-order  modulator  and  calculate  the 
quantization  noise.  Quantization  noise  in  this  modulator  depends  on  the  input 
signal.  For  a  busy  signal  the  quantization  noise  is  like  white  noise,  whereas  with 
a  DC  input  the  quantization  noise  is  colored.  To  calculate  the  effective  resolution 
of  the  ZA  modulator,  it  is  assumed  that  the  input  signal  is  sufficiently  busy  that 
the  quantization  error  of  the  1-bit  ADC  in  Figure  6.8  behaves  like  white  noise  that  is 
uncorrelated  with  the  input  signal.  Hence  the   1-bit  ADC  in  Figure  6.8  can 
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Fig.  6.9    Model  of  a  first  order  modulator 


be  modeled  as  a  gain  block  with  a  gain  of  1  together  with  an  additive  white  noise 
source,  as  shown  in  Figure  6.9.  The  1-bit  DAC  is  then  modeled  by  a  gain  block  with 
a  gain  of  1.  Finally,  the  integrator  is  shown  explicitly  using  its  discrete  time 
representation,  again  assuming  that  the  integrator  gain  is  1.  The  total  modulator 
quantization  noise  e-jiri)  [defined  as  y(ri)— x(n—  1)]  can  then  be  expressed  as  a 
function  of  the  1  -bit  ADC  quantization  noise  e(n)  as 


er(n)  —  e(n)  —  e(n  —  1) 


(6.1) 


We  next  take  the  z-transform  of  (6.1)  and  substitute  z  =  e'uil .  Here  co  is  the 
frequency  of  x(t)  in  rad/s;  T  is  period  of  the  sampling  clock  and  hence  equals  l/fs. 
The  spectral  density  N(f)  of  this  e-jin)  can  now  be  expressed  in  terms  of  the  spectral 
density  £(/)  of  e(n)  as: 


N(f)  =  £(/)|l  -  exp(-icoT)|  =  2ermsV2Tsm 


a>T 


(6.2) 


Here  E(f ),  being  the  spectral  density  of  e(n)  should  be  flat  with  an  rms  value 
equal  to  erms. 

From  (6.2)  it  is  seen  that  feedback  around  the  quantizer  reduces  the  noise  at  low 
frequencies  but  increases  it  at  high  frequencies.  The  total  noise  power  in  the  signal 
band,  defined  as  nbw  ,  is 


'hw 


\N(f)\2df  «  (eL)  y  (2/„,7f    /,  »  flnv 


(6.3) 
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Fig.  6.10    Second  order  modulator 

Since/,  is  the  sampling  frequency  and  equals  l/T,  hence  2fhvjT  is  the  inverse  of 
OSR.  Substituting  this  in  (6.3)  we  have  the  rms  value  of  nhw2,  denoted  as  «rms,  given 
approximately  by  erms-j={OSRY  ■  Each  doubling  of  the  OSR  of  this  circuit 
reduces  «rms  by  9  dB  and  provides  1.5  bits  of  extra  resolution.  The  improvement 
in  the  resolution  requires  that  the  modulated  signal  be  decimated  to  the  Nyquist  rate 
with  a  sharply  selective  digital  decimation  filter,  as  mentioned  previously.  Other- 
wise, the  high-frequency  components  of  the  noise  will  spoil  the  resolution  when  it  is 
sampled  at  the  Nyquist  rate. 


6.4.2    High  Order  Modulators 


The  procedure  for  increasing  the  resolution  with  feedback  can  be  reiterated  by 
replacing  the  1-bit  ADC  and  DAC  in  a  first  order  modulator,  as  shown  in  Figure  6.8, 
by  an  identical  first  order  modulator.  The  resulting  circuit  is  shown  in  Figure  6.10. 
Notice  that  we  have  two  integrators  with  two  separate  integrator  gains:  k\  and  k2- 
The  gains  kx,  k2  are  used  to  optimize  the  output  signal  range  of  the  integrators. 
Modulators  with  a  second-order  transfer  function  involve  the  cancellation  of  the 
two  past  samples  and  thus  exhibit  stronger  attenuation  at  low  frequencies.  The  N(f) 
of  a  first-  and  second-order  modulator  are  compared  in  Figure  6.12  [13]. 

Second  order  modulators  are  shown  to  be  conditionally  stable.  The  stability 
depends  on  the  total  delay  in  feedback  loop,  signal  amplitude,  and  coefficients  k\, 
k2.  To  avoid  saturation,  the  maximum  signal  levels  at  the  amplifier  outputs  should 
be  adjusted  by  scaling. 

The  preceding  analysis  can  be  formalized  to  yield  quantitative  results  for  the 
resolution  of  second-order  sigma-delta  modulators,  provided  that  the  spectral 
distribution  of  the  quantization  error  e(ri)  can  be  assumed  to  be  uncorrected. 
The  modulator  can  be  regarded  as  a  linear  system  for  which  the  spectral  density 
of  the  noise  can  be  calculated  [13].  Let  us  now  calculate  the  N(f)  and  the 
corresponding  noise  transfer  function  (NTF)  of  this  second  order  modulator.  Here 


NTF  is  defined  as 


X(z) 


where  N(z)  is  the  z-transform  of  the  modulator  noise  eT{n) 


and  X(z)  is  the  z-transform  of  the  input  x{n).  To  simplify  discussions  let  us  assume 
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Fig.  6.11  (a)  Integrator  with  forward  delay,  (b)  Integrator  with  feedback  delay,  (c)  One  type  of 
implementation  of  a  second-order  modulator 

that  ki  =  k2  =  1.  First  let  us  go  into  a  bit  of  digression  on  the  implementations  of 
these  two  integrators.  It  turns  out  that  there  are  two  types  of  implementation. 

A  block  diagram  representation  of  the  first  type  of  integrator  with  a  gain  of  1  is 
shown  in  Figure  6.11(a).  Notice  that  this  integrator  has  a  delay  element  z_1  in  the 
forward  path.  This  is  also  the  integrator  that  we  use  in  Figure  6.9.  Now  let  us  turn 
our  attention  to  Figure  6.11(b),  where  the  delay  has  been  moved  to  the  feedback 
path.  If  we  work  out  the  equation,  Figure  6.1 1(b)  also  describes  an  integrator. 

Henceforth  we  label  the  integrator  in  Figure  6.11(a)  type  1  and  that  in  Figure 
6.11(b)  type  2.  They  have  different  transfer  functions.  The  type  1  integrator  has 
a  transfer  function  given  by  z_1/(l  -z_1)  and  type  2  integrator's  transfer  function  is 
1/(1  -z_1).  As  it  turns  out,  when  we  calculate  A^/)  and  the  corresponding  NTF  for  a 
second-order  modulator,  we  can  simplify  the  mathematics  quite  a  bit  if  integrators 
1  and  2  in  Figure  6.10  are  implemented  using  type  2  and  type  1  integrators, 
respectively.  The  resulting  block  diagram  is  shown  in  Figure  6.11(c),  where  the 
output  of  the  modulator  can  be  expressed  as 


y(n)  =  x(n  -  1)  +  (e(n)  -  2e(«  -  1)  +  e(n  -  2)) 


(6.4) 
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Fig.  6.12   N(f)  for  first  and 
second  order  modulator 


N(f) 


Hence  the  total  modulator  noise,  ej{n),  is  now  the  second  difference  of  the 
quantization  error,  e(n).  Taking  the  z-transform  and  substituting  z  =  eimT  again, 
the  spectral  density  of  eT(n),  N(f),  is 


N(f)  =£(/)(!  -  exp(-jojT))2 


(6.5) 


|A/(/)|=4('„„a/27'snr  (^\ 


(6.6) 


and  the  rms  noise  in  the  signal  band  is  given  by 


n„, 


erms  ■%  (2f0T)5/2=  erms  -=  OSR^2    fs  » // 


V5 


v^ 


hw 


(6.7) 


This  noise  falls  by  15  dB  for  every  doubling  of  the  sampling  frequency, 
providing  extra  bits  of  resolution  [13].  Figure  6.12  shows  the  spectral  density  of 
the  quantization  noise  for  the  first-  and  second-order  modulators.  As  shown,  the 
quantization  noise  of  the  second-  order  modulator  is  smaller  inside /bw. 

We  mention  in  subsection  6.3.1  a  general  definition  of  dynamic  range 
(or  resolution).  Let  us  give  a  more  rigorous  definition  here.  The  dynamic  range 
(DR)  (or  resolution)  of  an  ADC  is  defined  as  the  ratio  of  the  input  signal  power  for  a 
full-scale  sinusoidal  input  to  the  input  signal  power  when  the  corresponding  SNR  is 
one  (0  dB).  Assuming  that  the  SNR  versus  input  amplitude  curve  is  linear  with  a 
slope  of  1,  then  DR  also  equals  approximately  SNRmax,  which  equals  the  ratio  of  the 
rms  value  of  the  largest  sine  wave  to  the  rms  value  of  the  total  modulator  noise. 
Assuming  that  the  1-bit  ADC  has  a  step  size  of  A,  the  full-scale  sine  wave  that  can 
be  applied  before  overloading  occurs  has  a  peak  value  of  A/2  and  its  rms  value  is 
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Fig.  6.13    Third-order  modulator 

A/(2V2).  Furthermore,  for  a  quantizer  of  step  size  A,  erms  —  -j=s.  Substituting  this 
and  nrms  from  (6.7)  into  the  DR  definiton,  we  have 

A      v^ 


A 

2s/l 


n,-„ 


2\fl  e„, 


OSR5/2  = 


n.5 


(OSR) 


5/2 


(6.8) 


Higher-order  modulators,  realized  by  adding  more  feedback  loop  to  the  circuit 
[13],  exist.  In  general,  when  a  modulator  has  L  loops  and  is  not  overloaded,  it  can  be 
shown  that  the  spectral  density  of  the  modulator  noise  is 


\N(f)\  =  ermsV2T 


2si,l,TV 


(6.9) 


For  oversampled  ratios  greater  than  2,  the  rms  noise  in  the  signal  band  is  given 
approximately  by 


>hr, 


v/2L~TT 


(2flm,T)L+>=  em 


V2L+1 


(OSR) 


-M) 


(6.10) 


This  noise  falls  by  3  x  (2L+1)  dB  for  every  doubling  of  the  sampling  frequency, 
providing  (L  +  i)  extra  bits  of  resolution.  The  extra  dynamic  range  extended  to  the 
1-bit  ADC  can  now  be  written  as  [10] 


DR  =  SNRn 


A 
2y/2 

H/ms 


3(2L+1)  1 


A 
"2^2 

(OSR) 


1 

mrA0SRy 


-(^) 


M) 


(6.11) 


For  wireless  communication,  where  we  want  to  minimize  power,  reducing  the 
oversampling  frequency  is  desirable,  as  this  would  reduce  the  clock  frequency  and 
hence  power.  However,  achieving  this  through  the  use  of  higher  order  modulators 
encounter  some  difficulties,  particularly  as  we  move  beyond  two  integrators. 

First  let  us  look  at  one  such  implementation  of  a  third-order  modulator,  as  shown 
in  Figure  6.13.  Here  again,  the  integrators  all  have  gains  of  1.  Also,  except  for  the 
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integrators  right  in  front  of  the  quantizer,  all  of  the  integrators  are  type  2  integrators. 
If  we  model  the  1-bit  ADC,  1-bit  DAC  as  before,  then  this  third-order  modulator 
has  a  DR  as  given  by  (6.11),  with  L  =  3. 

The  major  difficulty  lies  in  the  fact  that  due  to  feedback,  signal  at  the  input  of  the 
internal  quantizer,  w(n),  may  accumulate.  This  eventually  overloads  the  modulator, 
and  when  this  happens  w(n)  and  other  output  nodes  of  the  integrators  exhibit  limit 
cycle  oscillations  with  large  amplitude,  making  the  modulator  unstable  and 
degrading  the  DR.  Notice  that  this  never  happens  in  a  Nyquist  rate  ADC  since  it 
does  not  have  feedback. 

To  help  stabilize  these  circuits,  integrators  outputs  need  to  be  clipped,  resulting 
in  DR  performance  much  worse  than  predicted  by  (6.11).  Better  performance  is 
obtained  by  redesigning  the  filter  used  in  the  feedback  loop.  An  example  is  shown 
in  the  appendix,  where  this  high-order  sigma-delta  modulator  has  been  redesigned 
to  achieve  very  low  voltage  (hence  low  power)  operation  and  proves  to  be 
extremely  useful  as  a  baseband  ADC. 

Other  stable  high  order  modulations  include 

•  Interpolative  architecture  [2] 

•  MASH  architecture  [3] 

Now  that  we  have  discussed  the  basic  theory  of  sigma-delta  modulators,  we 
present  the  implementation  issues. 


6.5     Implementation  of  Low-Pass  Sigma-Delta  Modulators 

In  our  discussions  of  modulators,  the  subblocks  (integrators,  1-bit  ADC)  are 
assumed  to  be  ideal.  Real  life  implementation  of  modulators  falls  into  two  broad 
categories,  depending  on  how  the  loop  filter  is  implemented:  continuous  time  based 
and  switched  capacitor  based.  To  remain  focused,  we  assume  that  the  sigma-delta 
modulators  are  implemented  using  a  switch-capacitor  approach.  The  design  of 
switch-capacitor-based  circuits  can  be  quite  involved,  and  complete  books  have 
been  dedicated  to  them.  For  example,  [1 1]  is  a  good  reference.  Here  we  just  review 
the  basic  switch-capacitor  integrators  in  enough  depth  that  we  can  carry  out  the 
design  of  sigma-delta  modulators. 


6.5.1     Review  of  Switch-Capacitor-Based  Integrators 

An  example  of  a  switch-capacitor-based  integrator  is  shown  in  Figure  6.14  and 
consists  of  an  op-amp,  two  capacitors,  and  switches.  The  sampling  clock/,  consists 
of  two  non-oversampling  phases,  4>!  and  4>2.  Corresponding  to  these  two  phases,  the 
switches  configure  the  integrator  in  two  configurations.  In  phase  <j>i,  the  integrator 
is  configured  as  shown.  Hence  input  voltage  V,  is  sampled  on  capacitor  Cj .  In  phase 
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Fig.  6.14    Single-ended  switched  capacitor  integrator 

4>2,  the  left-hand  side  of  C\  is  switched  to  ground,  and  the  right-hand  side  is 
switched  to  the  summing  node  (negative  terminal  of  the  op-amp).  Also,  the  output 
of  the  op-amp  is  connected  to  V0.  Now  since  this  summing  node  is  a  virtual  ground, 
the  voltage  across  C\  must  be  zero,  which  means  the  charge  must  be  zero. 
From  charge  conservation,  the  charge  deposited  on  C\  during  §x  has  to  go  some- 
where. The  only  place  it  can  go  to  is  Ci.  Hence  the  additional  charge  deposited  on 
C2  is  Vj  x  C\.  Therefore  the  change  in  voltage  across  capacitor  C2,  denoted  as  AV2, 
is  given  by  (C1/C2)  X  Vv  Therefore,  V2  (at  the  end  of  the  clock  cycle)  =  V2  (at  the 
beginning  of  the  clock  cycle)  +  (C1/C2)  x  V,.  But  V2  lS  just  the  same  as  the  output 
voltage  of  the  integrator  V0  and  accordingly  V0  (at  the  end  of  the  clock  cycle)  =  V0 
(at  the  beginning  of  the  clock  cycle)  +  (CJC2)  x  Vj.  This  is  the  formula  that 
describes  an  integrator  with  gain  equal  to  C1/C2  and  so  the  switch  capacitor  circuit 
in  Figure  6.14  does  realize  the  integrating  function.  As  a  matter  of  fact  the  circuit  in 
Figure  6.14  realizes  a  type  1  integrator,  as  described  in  Figure  6.11(a),  although 
here  the  gain  is  Ci/C2.  From  a  switch-capacitor  implementation  point  of  view,  a 
type  1  integrator  is  more  desirable.  Hence,  from  now  on  we  assume  that  all 
integrators  are  implemented  as  type  1,  with  the  proper  gain,  of  course. 


6.5.2     Type  1  Switched-Capacitor-Cased  Integrator 


Using  only  a  type  1  integrator  has  a  subtle  but  important  effect  on  the  NTF  and 
stability  for  modulators  of  order  2  or  above.  As  a  reminder,  let  us  remember  that  for 
second,  third-,  and  higher-order  modulators,  so  far  we  have  derived  the  NTF  and 
hence  DR  formula  using  a  architecture  that  uses  type  1  integrators  in  only  the 
innermost  loop  and  type  2  integrators  in  the  outer  loops.  For  example,  (6.4)  through 
(6.1 1)  have  been  derived  with  this  in  mind. 

Since  in  subsection  6.5.1   we  state  that  type   1   integrators  should  be  used 
exclusively,  we  want  to  ask  the  following:  what  impact  does  this  have  on  the 
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sigma-delta  modulator?  Let  us  start  by  examining  its  effect  on  noise  through 
recalculating  the  NTF  and  DR.  First  we  note  that  the  delay  in  a  type  1  integrator's 
forward  path  introduces  extra  delay  in  the  forward  path  of  the  sigma-delta 
modulator.  These  delays  are  going  to  introduce  poles  in  the  NTF,  and  we  no  longer 
has  the  simple  formula  for  DR  as  derived  in  (6.11).  However  typically  these  poles 
have  frequencies  that  are  so  far  away  from  /bw  that  their  impact  is  quite  modest. 
Consequently,  (6.11)  still  serves  as  a  good  first-order  approximation  for  DR. 
Second,  there  is  the  effect  on  the  stability.  Because  this  delay  introduces  poles  in 
the  signal  transfer  function  (STF),  stability  is  disturbed  and  has  to  be  reexamined. 
Here  STF  is  defined  as  -44  where  Y(z)  is  the  z-transform  of  the  output  y(n)  and  X(z) 
is  the  z-transform  of  the  input  x(n).  Both  of  these  effects  also  carry  over  to 
modulators  of  higher  order,  L. 


6.5.3    Non-Ideal  Integrator 

Up  to  now  we  have  assumed  that  the  op-amp  and  switches  in  Figure  6.14  are  ideal. 
For  the  op-amp  this  means  it  has  infinite  gain,  speed,  and  zero  noise  and  for  the 
switch  this  means  it  has  zero  resistance,  noise,  and  infinite  speed.  How  true  are 
these  assumptions?  A  simple  op-amp  consists  of  a  differential  pair  with  some 
resistive  load,  and  a  simple  switch  consists  of  a  MOS  transistor.  Both  are  not 
ideal  due  to  the  fact  that  the  operational  amplifier  has  finite  DC  gain,  and  bandwidth 
(and  hence  nonzero  settling  time)  and  the  switch  has  similar  nonidealities.  In  the 
following  subsection,  the  impact  of  these  nonidealities  on  the  integrator  discussed. 
For  more  complete  treatment,  see  [11]. 


6.5.3.1     Finite  Op-Amp  Gain 

Due  to  finite  op-amp  gain,  the  integrator's  frequency  response  exhibits  a  finite  DC 
value  because  the  pole  is  no  longer  at  DC.  This  movement  of  an  integrator's  pole 
results  in  the  movement  of  the  zeroes  of  the  NTF  so  that  the  N(f)  changes  from  that 
as  shown  in  Figure  6. 12  to  that  as  shown  in  Figure  6. 15.  We  can  see  that  the  N(f)  no 
longer  goes  to  zero  at  DC  but  remains  finite.  This  introduces  excess  quantization 
noise  within /bw  and  is  going  to  compromise  the  overall  SNR  and  hence  DR,  as 
predicted  by  (6.11).  In  general,  the  more  the  leakage  (the  smaller  the  frequency 
response  of  the  integrator  at  DC,  or  the  smaller  the  frequency  response  of  the  chain 
of  integrators  at  DC),  the  larger  N(f)  at  DC  is.  As  a  good  rule  of  thumb,  a  first  order 
modulator  needs  an  integrator  whose  frequency  response  at  DC  is  greater  than  the 
OSR.  For  a  second-order  modulator  since  the  forward  path  consists  of  a  cascade  of 
two  integrators,  the  individual  integrator  can  have  a  frequency  response  at  DC  that 
is  smaller  than  the  OSR.  This  lower  limit  on  the  frequency  response  of  the 
integrators  at  DC  will  put  a  lower  limit  on  the  gain  of  the  op-amp  that  is  used  to 
realize  these  integrators.  To  see  how  they  are  related,  first  define  H(z)  to  be  the 
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Fig.  6.15    N(f)  for  first  and 
second  order  modulators  with 
leaky  integrators 


transfer  function  of  the  single-ended  switched  capacitor  integrator  shown  in 
Figure  6.14.  This  transfer  function,  modified  with  finite  operational  amplifier 
gain,  can  be  expressed  as  [11] 


H{z) 


(d/CzK'/^l-l/A-CiAACz)) 
l-{l~C1/(AC2))z-1 


(6.12) 


Here  A  is  the  gain  of  the  op-amp.  As  an  example,  using  our  rule  of  thumb  for  a 
first-order  modulator,  the  frequency  response  of  the  integrator  at  DC,  //(z)z=1,  has  to 
be  larger  than  OSR.  We  can  then  substitute  this  in  (6.12)  (with  z  =  1)  and  find  out 
the  required  A.  This  usually  means  A  >  OSR  as  well. 

For  second-  and  higher-order  modulators,  the  required  //(z)z=1  can  be  relaxed. 
The  exact  relaxed  requirement  can  be  worked  out.  However,  instead  of  doing  that, 
we  want  to  simplify  matters  further.  We  can  simplify  matters  by  stating  as  a  rule  of 
thumb  that  if  A,  the  op-amp  gain  for  all  the  op-amps  used  in  the  individual 
integrators,  is  larger  than  OSR,  the  leakage  is  tolerable: 


A>OSR 


(6.13) 


6.5.3.2     Finite  Op- Amp  and  Switch  Speed 

Depending  on  the  op-amp's  settling  behavior,  different  op-amp  models  can  be 
obtained  that  will  then  be  used  to  predict  the  op-amp's  impact  on  the  integrators' 
and  hence  modulators'  performance.  If  the  finite  speed  in  the  op-amp  used  in  the 
integrator  is  modeled  by  representing  the  op-amp  using  a  transfer  function  with  a 
single  pole,  then  incomplete  settling  behaves  in  a  linear  fashion.  This  means  H(z)  is 
changed  so  that  the  integrator  appears  leaky  again,  as  discussed  in  subsection 
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6.5.3.1.  Consequently,  this  has  a  similar  impact  on  the  quantization  noise.  As  an 
example,  reconsider  the  integrator  as  shown  in  Figure  6.14.  Let  us  assume  that  the 
op-amp's  transfer  function  has  a  pole  given  by  p\.  Now  the  integrator's  transient 
response  during  period  <f>2  is  given  by 

V0  =  Vfl(l-e-(r/2))  (6.14) 

Here  we  assume  that  the  period  of  4>2  is  half  of  T,  the  period  of  the  sampling  clock 
(in  practice  4>2  has  to  be  less  than  T/2  to  make  the  two  phases  nonoverlapping). 
Moreover,  x  is  the  op-amp's  settling  time  constant  and  is  given  by  l/pu  and  Va  is  the 
voltage  applied  to  the  input  of  the  op-amp  at  the  beginning  of  4>2,  when  the  right-hand 
side  of  Ci  is  switched  from  ground  to  the  negative  terminal  of  the  op-amp.  For  a 
constant  sampling  period,  the  second  term  inside  the  parentheses  represents  a  con- 
stant reduction  in  the  gain  of  the  integrator,  hence  making  the  integrator  look  leaky.  It 
turns  out  that  this  leakage  is  not  too  severe.  Even  if  T/2  is  made  to  be  as  short  as  Ax, 
the  equivalent  gain  of  the  integrator  is  reduced  by  less  than  2%.  Again,  if  the  settling 
is  linear,  typically  settling  error  of  0.1%  can  be  tolerated.  On  the  other  hand,  if  the 
settling  is  entirely  dominated  by  slewing  in  the  op-amp  (this  usually  occurs  at  the 
early  part  of  the  response  when  a  step  input  is  applied  to  the  op-amp,  during  which  V0 
is  trying  to  ramp  towards  the  applied  Va;  the  op-amp  behaves  in  a  nonlinear  fashion 
during  this  stage),  then  the  preceding  description  is  not  true.  If  the  error  is  referred 
back  to  the  input,  and  if  every  time  the  op-amp  settles  the  error  is  made  so  that  it 
always  accumulated,  then  it  can  be  shown  that  to  guarantee  an  n-bit  DR  for  the 
modulator,  the  settling  error  of  the  individual  integrator,  and  hence  the  accompanying 
op-amp,  should  also  be  at  the  n-bit  level.  This  is  because  the  op-amp  error  is  referred 
back  to  the  input  of  the  integrator  and  hence  modulator.  Thus  any  error  it  makes  is 
identical  to  injecting  the  same  error  at  the  input  of  the  modulator.  For  example,  to 
guarantee  a  100  dB  DR,  the  op-amp  needs  to  settle  to  within  0.001%  of  final  value  in 
T/2.  Accordingly,  slew  rate  limiting  should  be  avoided  and  simulations  should  be  run 
to  ensure  that  the  settling  is  essentially  linear. 

Finally,  we  have  to  examine  settling  problems  due  to  the  RC  time  constant  of  the 
MOS  switch.  This  can  usually  be  solved  simply  by  increasing  the  width  of 
the  switch.  Of  course,  there  is  an  ultimate  limit  in  that  as  the  switch  becomes  larger 
in  comparison  to  the  switched  capacitor,  the  effects  due  to  channel  charge  injection 
and  the  parasitic  junction  capacitances  also  increase,  as  we  have  discussed  in  the 
appendix  to  Chapter  5  (where  we  discuss  SAH). 

In  summary,  for  the  integrator  shown  in  Figure  6.14,  the  modified  integrator 
transfer  function  due  to  finite  op-amp  bandwidth  and  assuming  a  linear 
settling  behavior  is  [11] 

H{z)  =  (C1/C2)z-1/2[(l-e)+z-'£C2/(C1+C2)] 


where 


e  =  e-rf'T  (6.16) 
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Here  T  is  the  sampling  period  and/,  is  the  unity  gain  bandwidth  of  the  op-amp  (in 
hertz).  Notice  that  in  (6.16)  we  assume  that  a  feedback  factor  is  1,  although  in  real 
life  it  is  somewhat  modified  by  the  ratio  of  Ci  to  C2.  We  will  have  a  chance  to  revisit 
this  assumption  when  we  talk  about  switch-capacitor-based  resonator  in  subsection 
6.7.2.2.  For  a  one-pole  response,/,  —  Ax  2npi 

Similarly,  the  modified  transfer  function  due  to  nonzero  switch  resistance  is  [11] 


(C1/C2)z-'/2(l-2e-r/4g°"c') 
1  -z- 


H{z)  =  ^L^l _^_ L  (6. 17) 


Here  Rnn  is  the  on-resistance  of  the  switch. 


6.5.3.3     Integrator  Noise 

Achievable  DR  in  a  sigma-delta  modulator  is  constrained  by  the  available  signal 
swing  at  one  extreme  and  noise  sources  at  the  other.  So  far  we  have  concentrated  on 
quantization  noise  and  use  that  to  derive  DR,  which  was  given  in  (6.8).  Noise  can 
also  arise  from  the  power  supply  or  substrate  coupling,  from  clock  signal  feed- 
through,  and  from  thermal  and  1// noise  generation  in  the  MOS  devices. 

l/f  noise  can  be  a  problem  in  the  circuit  but  is  usually  taken  care  of  by  using  large 
input  devices.  Thermal  noise  is  a  more  fundamental  problem.  Because  of  thermal 
noise  in  the  switch  resistance,  each  voltage  sampled  onto  a  capacitance  C  exhibits 
an  uncertainty  of  variance  kT/C,  where  k  is  the  Boltzmann  constant  and  T  is  the 
absolute  temperature.  For  an  integrator  circuit  like  that  in  Figure  6.14,  the  input 
capacitor  C\  sees  two  switch  paths  per  clock  cycle,  so  the  noise  variance  increases 
by  a  factor  of  2.  Hence  the  noise  sampled  on  it  has  a  variance  equal  to  2kTIC\.  Now 
this  is  filtered  by  the  integrator.  This  integrator  has  a  gain  of  CJC2  and  has  a  noise 
bandwidth  (fh)  that  is  approximately  half  of  its  unity  gain  frequency  (/,).  Hence  the 
final  output  noise  variance  is  2kT/Cx  x  CJC2  xfb/fu  —  2kTICx  x  Cx/C2  xl/2  =  kT/C2. 
(Actually  the  noise  sources  in  the  switches  and  op-amp  interact  due  to  bandwidth 
effects,  but  it  has  been  established  that  a  lower  bound  for  the  combined  op-amp  and 
switch  thermal  sources  in  this  configuration  is  kT/C2)  Due  to  over-sampling,  this 
noise  variance  is  reduced,  effectively  by  the  OSR  to  kT/(OSR  x  C2),  or  the  standard 
deviation  is  \/ 0S^xCl-  Next  we  want  to  determine  the  maximum  RMS  value  of  a 
sinusoidal  signal,  within  a  full-scale  voltage  VFs>  and  this  turns  out  to  be  VFS/(V2). 
Taking  the  ratio  of  the  two  gives  a  maximum  signal-to-thermal-noise  ratio  for  a 
sigma-delta  modulator,  which  is  also  its  dynamic  range  DR: 


DR  =  SNRmax  =  VFS  ^0SRx^r  (6- 18> 

Notice  that  (6.9)  gives  the  DR  limited  by  quantization  noise  and  (6.18)  gives  the 
DR  limited  by  thermal  noise. 
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Fig.  6.16    Single-ended  switched  capacitor  integrator  with  1-bit  DAC 


6.5.4    1 -bit  ADC 

A  1-bit  ADC  is  just  a  single  comparator.  Ideally  such  a  comparator  should  have 
zero  delay,  zero  offset.  A  simple  implementation  of  such  a  comparator  consists  of  a 
differential  pair  with  load  (just  like  the  op-amp),  which  again  suffers  from  having 
finite  speed  and  finite  offset.  Because  of  the  feedback  action  provided  by  the  sigma- 
delta  modulator,  this  comparator  offset  will  translate  into  an  equivalent  offset  at 
the  input  of  the  modulator,  without  increasing  the  quantization  noise  of  the 
modulator.  As  for  the  speed,  the  comparator  has  to  switch  and  make  a  decision  in 
one  sampling  clock  period.  Compared  with  the  op-amp  used  to  realize  integrator, 
the  comparator  output  does  not  have  to  settle  to  as  high  an  accuracy.  Moreover,  the 
comparator  operates  in  a  open  loop  configuration,  which  is  faster  than  the  closed- 
loop  configuration  that  the  op-amp  has  to  work  in.  Hence  the  speed  requirement  is 
usually  met  easily.  The  thermal  noise  is  not  important  since  it  is  noise  shaped. 


6.5.5    1 -bit  DAC 


Figure  6.14  is  redrawn  in  Figure  6.16  to  show  how  this  1-bit  D/A  conversion  is 
done.  The  latched  comparator  decision  (the  digital  output  of  the  1-bit  ADC)  v(«)  is 
used  to  apply  selectively  a  +Vref  or—  Vref  voltage,  through  a  separate  capacitor  Cref, 
to  the  integrators  to  perform  a  1-bit  D/A  conversion.  In  Figure  6.16,  if  y(n)  is  1,  then 
Vlef  will  be  applied  to  Cret-  during  (J)i.  During  4>2,  this  charge  will  be  transferred  to 
C2  and  changes  the  output  by  Vref  x  (Cref/C2).  On  the  other  hand  if  y(n)  is  0,  then 
-Vlef  will  be  applied  and  at  the  end  of  4>2  output  changes  by  —  Vref  x  (Cref/C2). 
Consequently,  a  1-bit  D/A  conversion  plus  integration  is  achieved.  Because  of  the 
1-bit  nature,  the  D/A  conversion  is  inherently  linear.  Of  course,  any  noise  on  the 
Vlef  and  -Vref  will  manifest  itself  as  input  noise  and  must  be  reduced. 
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Now  that  we  have  introduced  all  the  basic  theory  and  implementation  issues,  we 
go  through  the  design  procedure. 


6.5.6    Design  Procedure 

In  this  subsection  we  explain  the  steps  it  takes  to  design  a  low-pass  sigma-delta 
modulator.  The  design  procedure  consists  of  the  following  steps: 

1.  From  Chapter  2,  find  the  specifications  for  ADC  in  terms  of  DR,  full-scale 
voltage  (Vps),  bandwidth  of  digitization  (/i,w),  and  the  IF. 

2.  Let  us  assume  that  we  fix  the  order,  L,  of  the  modulator.  We  then  assume  that  the 
modulator  is  implemented  using  type  2  integrators  except  in  the  innermost  loop. 
Furthermore,  the  gains  are  all  assumed  to  be  1.  Then,  for  a  given  L  and  DR,  we 
will  determine  the  OSR  using  (6.11).  Usually  DR  is  given  in  log  scale  (i.e,  in 
decibel).  Therefore,  it  is  more  convenient  to  work  with  the  logarithmic  form  of 
(6.1 1).  The  dynamic  range  expression  in  logarithmic  form  is 


DR\dB  =  2Q\ogDR  =  10  log 


l^iOSRf^ 


(6.19) 


For  example,  for  a  single-stage  first-order  modulator  the  formula  for  DR 
equation  in  decibels  becomes 

DR\dB  =  -3.41  +  30  log(OSR)  (6.20) 

and  for  a  second-order  modulator  the  DR  equation  in  decibels  is 

DR\dB  =  -11. 14  +  50  log(OSR)  (6.21) 

After  determining  the  OSR,  the  sampling  frequency  (fs)  can  be  calculated  for  a 
given  bandwidth.  If  the  calculated  sampling  frequency  is  too  large  for  a  given 
technology,  the  order  must  be  increased.  In  addition,  the  sampling  frequency 
should  be  kept  below  the  IF. 
3.  The  initial  design  from  step  2  is  now  reiterated  by  changing  the  implementation 
to  one  based  on  type  1  integrators  only.  With  the  new  types  of  integrators  we 
perform  signal  scaling  by  changing  the  gains  of  the  integrators  and  feedback 
factors  so  that  this  new  modulator  can  achieve  the  original  DR.  Stability  is  then 
checked.  If  the  modulator  is  unstable,  we  reduce  the  out-of-band  gain  of  the 
NTF,  again  by  changing  the  gains  of  the  integrators.  If  the  modulator  has  a  large 
safety  margin  for  stability,  we  can  check  and  see  if  we  can  improve  the  DR  by 
increasing  the  out-of-band  gain  of  the  NTF,  also  via  changing  the  gains  of  the 
integrators.  At  the  end  of  this  step,  the  gains  of  individual  integrators  and 
feedback  factors  from  the  1-bit  DAC  to  these  integrators  are  finalized 
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4.  Determine  the  minimum  requirement  for  each  block  of  the  sigma-delta 
modulator  to  meet  the  specification.  To  complete  the  design,  op-amp 
specifications  such  as  DC  gain,  and  unity  gain  bandwidth  have  to  be  found. 
The  important  parameters  can  be  summarized  as  follows: 

(a)  Minimum  gain  for  op-amp.  One  simple  rule  is  to  set  the  gain  to  be  a  few 
times  of  OSR  (we  arbitrarily  set  that  to  5  here). 

A  =  5x  OSR  (6.22) 

(b)  Unity  gain  bandwidth  of  op-amp  as  dictated  by  clock  frequency  and  settling 
error.  The  settling  behavior  as  explained  in  subsection  6.5.3.2  is  dependent 
on  the  sampling  frequency.  To  quantify  this  we  assume  linear  settling 
behavior.  First,  the  values  for  C1/C2,  Ci](C\  +  C2)  required  in  (6.15) 
can  be  determined  from  integrator  gains  (determined  in  step  3).  Also,  in 
(6.15)  z  =  ejmT,  where  <x>  =  2nfhw  (fhw  has  been  determined  in  step  1)  and  T  = 
l/fs  (fs  has  been  determined  in  step  2).  Next  we  can  calculate  H(z)no  error  by 
setting  s  to  0  in  (6.15).  Let  us  assume  that  we  can  tolerate  an  error  of  0.1%, 
which  is  good  enough  for  most  applications.  Then  we  find  H 
(z)error=o.i%=99.9%  x//(z)no  eiTor.  Substituting  this  in  (6.15)  again,  we  can 
calculate  s  corresponding  to  an  error  of  0.1%.  Substituting  this  in  (6.16),  we 
can  find/,.  To  simplify  the  procedure  further,  we  observed  that  for  most  C\, 
C2  values,  the/,  so  obtained  is  on  the  order  of  5  to  10  times  that  of/,  the 
sampling  frequency.  We  can  then  use  the  following  rule  of  thumb  to  find/: 

ft  =  5/s  (6.23) 

(c)  Size  of  capacitors  of  individual  integrators  as  dictated  by  thermal  noise 
consideration.  The  maximum  dynamic  range  limited  by  thermal  noise  is 
given  by  (6.19)  and  repeated  here: 


DR:     \lsx  \  0SRx^  (6-24> 


Here  VFS  is  the  full-scale  input  voltage  of  the  modulator  and  must  be  less  than  Vdd, 
the  power  supply  voltage.  Notice  that  the  DR  obtained  in  (6.24)  must  be  larger  than 
DR  in  step  1  with  a  good  safety  margin.  Typically  for  a  given  Vdd,  the  VFS  is  limited. 
Once  DR  and  Vps  are  determined,  the  value  can  be  substituted  into  (6.24)  to 
calculate  the  proper  value  of  C2.  C\  is  then  calculated  from  C2  and  the  integrator 
gain.  Cref  is  calculated  from  C2  and  the  feedback  factor  (determined  in  step  3). 
The  following  design  example  illustrates  this  procedure. 

Design  Example  6.1.  In  this  design  example,  by  using  the  design  steps  explained 
in  the  subsection  6.5.6  a  complete  design  of  a  low-pass  sigma-delta 
modulator  is  presented. 


6.5     Implementation  of  Low-Pass  Sigma-Delta  Modulators  315 

Step  1:  We  start  from  the  example  ADC  given  in  Problem  2.9,  Chapter  2.  Instead  of 
digitizing  at  800  MHz,  we  assume  that  a  mixer  is  added  to  translate  the  signal  to  a 
20  MHz  IF.  Hence  the  ADC  is  digitizing  a  20  MHz  IF  with  a  200  kHz  bandwidth 
and  a  72  dB  (12  bit)  DR.  Instead  of  a  VFS  of  3.13  V,  we  change  to  a  VFS  of  1  V. 

Step  2:  The  order  of  the  modulator,  OSR,  and  the  sampling  frequency.  First  let 
us  assume  that  a  second-order  modulator  is  used,  again  with  type  2  integrator, 
except  in  the  innermost  loop.  Let  us  repeat  (6.20),  the  DR  equation  for  a  second- 
order  modulator: 

DR\dB  =  -11.14  +  50log(OSR)  (6.25) 

Substituting  a  DR  of  72  dB  into  the  preceding  equation,  we  get  an  OSR  of  46.55. 

With  an  OSR  of  46.55  and  a  200  kHz  bandwidth,  the  sampling  frequency  should 
be  chosen  to  be  at  least  18.6  MHz.  This  is  a  reasonable  sampling  frequency  for  a 
switch-capacitor-based  implementation  under  the  current  technology  and  no  itera- 
tion on  the  order  of  modulator  is  needed.  Since  IF  is  20  MHz,  fi0  =  20  MHz. 
In  general,  to  simplify  the  generation  of/[0  and/s,  we  set  the  two  to  be  related  by  an 
integer  ratio,  with  /s  <  fl0.  In  the  present  case  this  can  be  achieved  by  setting  the 
sampling  frequency  to  20  MHz  and  hence  the  final  OSR  becomes  50. 

Step  3:  Determine  the  NTF  transfer  function.  In  step  2  we  determined  that  a  second- 
order  modulator  is  sufficient  to  achieve  the  DR  specification.  Next  we  want  to 
derive  the  NTF  of  this  modulator  using  type  1  integrators  and  with  proper  scaling. 
First  we  take  Figure  6.1 1(c)  and  change  all  integrators  to  type  1  integrators.  Figure 
6.1 1(c)  now  becomes  Figure  6.17(a).  Notice  that  these  figures  have  the  same  NTF. 
Next,  notice  that  in  this  new  configuration  integrator  2  has  a  gain  of  2  and  so  the 
signal  swing  at  the  output  of  integrator  2  is  large  compared  with  the  signal  swing  at 
the  output  of  integrator  1,  which  only  has  a  gain  of  0.5.  Hence  as  x(n)  increases  in 
amplitude,  the  output  of  integrator  2  reaches  maximum  (limited  by  Vdd)  sooner  and 
limits  the  dynamic  range  of  the  whole  modulator. 

In  order  to  maximize  the  dynamic  range  of  the  modulator,  we  want  to  make  the 
internal  swings  all  roughly  equal.  This  can  be  achieved  by  changing  the  gain 
preceding  the  integrators  (similar  to  scaling  in  typical  filter  design).  In  the  present 
example,  this  can  be  done  by  putting  a  gain  of  less  than  1  in  front  of  each  integrator. 
One  way  of  performing  this  is  shown  in  Figure  6.17(b).  Since  we  have  changed  the 
gain  of  integrators,  the  NTF  changes  and  both  DR  and  stability  must  be  checked. 
We  rederive  the  NTF,  assuming  a  white  additive  noise  model: 

4(1  -z"1)2 
^  =  3^-6^+4  (6-26) 

Notice  that  this  NTF  has  poles  and  differs  from  the  NTF  derived  for  the 
modulator    in    Figure    6.11(c).     [the    NTF    derived    there    is    restated    here: 

NTF  =  (1  —  z  )  ,  and  has  no  poles].  From  (6.26)  we  can  see  that  there  are  two 
poles,  each  with  a  radius  of  0.869  and  with  angles  of  ±60°.  Hence  the  poles  are 
inside  the  unit  circle  and  the  modulator  is  stable.  Furthermore,  <Mbw(=27i/bw)  is  so 
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Fig.  6.17  (a)  Second-order  modulator  using  only  type  1  integrator,  (b)  Second-order  modulator 
using  only  type  1  integrator  and  with  scaling 

far  away  from  the  poles  ((ohw/a>s  is  given  approximately  by  the  ratio  360° /OSR  or 
7.2°,  which  is  much  less  than  60°)  in  the  present  case  that  the  poles  hardly  affect  the 
NTF  (hence  DR)  and  so  no  iteration  on  the  last  step  is  needed. 

Step  4:  Determine  the  minimum  requirement  for  each  block. 

(a)  Op-amp  gain:  The  minimum  required  op-amp  gain  can  be  calculated  by 
satisfying  (6.13).  To  give  some  safety  margin  we  use  (6.22)  or  A  =  250. 

(b)  Op-amp  unity  gain  bandwidth:  The  sampling  frequency  fs  is  chosen  to  be  20 
MHz.  Let  us  assume  a  linear  settling  behavior  and  a  tolerable  error  of  0.1%. 
This  is  in  line  with  the  conditions  that  lead  to  the  simplifications  in  step  4(b)  of 
the  design  procedure.  Hence  we  can  use  the  rule  of  thumb  in  (6.23)  and  set  the 
unity  gain  bandwidth  of  the  op-amp  ft  to  5  x  fs,  or  100  MHz.  An  op-amp  with 
this  unity  gain  frequency  can  be  readily  implemented  in  the  current  technology. 

(c)  Sampling  Capacitor  C\\  To  find  the  bound  on  C2  we  turn  (6.24)  into  an 


inequality,  DR<Vf$  X  */OSR  x  ^=.  We  can  then  substitute  the  values  of  DR, 

OSR,  and  VFS  obtained  from  previous  steps,  which  equal  72  dB,  50,  and  1  V, 

respectively.  Hence  we  have  72t/B<201ogl  1  V  x  J 50  x  |^)  and  we  find  that 

a  C2  of  0.5  pF  is  more  than  enough  to  meet  the  requirement.  Next,  from  step  3, 
we  have  finalized  the  integrators'  gains  and  feedback  factors,  which  were 
shown  in  Figure  6.17(b).  From  Figure  6.17(b)  we  see  that  the  gain  k;  of  the 
first  integrator  is  V2.  Therefore,  CJC2  —  Vi  or  C\  =  C2  x  Vi  =  0.25  pF.  Since 
gain  k2  of  the  second  integrator  is  also  V2,  we  can  simply  copy  the  capacitor 
values  for  integrator  1  to  those  of  integrator  2.  In  general,  the  size  of  capacitors 
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Fig.  6.18    Block  diagram  of  IF  digitizer  using  merged  mixer/sampler  and  a  passive  loop  filter 

of  the  second  stage  can  be  made  smaller,  since  any  thermal  noise  from 
the  switch  associated  with  these  capacitors  is  noise-shaped.  Finally 
again  from  Figure  6.17(b)  the  feedback  factor  of  the  modulator  is  1  and  so 
Cref  =  C\  =  0.25  pF  for  both  integrators  1  and  2. 
(d)  Comparator  speed:  The  comparator  speed  has  to  be  fast  enough  that  the 
output  code  y(n)  is  made  available  in  less  than  half  of  the  clock  period,  or  25 
ns.  As  mentioned,  the  offset  or  the  thermal  noise  of  the  comparator  is  not 
very  important. 

Now  that  we  have  gone  through  a  design  example,  we  discuss  special  low-power 
architectures  for  wireless  communication. 


6.5.7    Passive  Low-Pass  Sigma-Delta  Modulator 


One  particular  architecture  of  interest  to  wireless  communication  is  the  passive 
low-  power  sigma-delta  modulator.  This  architecture  differs  from  the  conventional 
sigma-delta  modulator  we  have  described  so  far  in  two  aspects.  First,  the  mixer  in 
front  of  the  ADC  (e.g,  if  we  use  this  architecture  as  the  baseband  ADC  described  in 
Figure  6.4  it  will  be  the  mixer  in  front  of  the  baseband  ADC  in  that  figure)  is  merged 
with  the  sampler  (shown  in  Figure  6.8)  inherent  in  the  baseband  ADC.  This  sampler 
is  shown  explicitly  in  Figure  6.8  for  the  case  when  the  baseband  ADC  is 
implemented  with  a  sigma-delta  modulator.  Since  the  mixer  is  implemented  using 
a  sampler-type  structure,  it  is  passive  and  is  identical  to  the  sampling  mixer 
described  in  Chapter  5.  Second,  this  architecture  differs  from  a  conventional 
sigma-delta  modulator  in  that  the  integrators  are  replaced  with  a  passive  loop  filter. 
In  the  present  case  the  passive  loop  filter  is  implemented  using  R  and  C  only,  with 
the  R  being  realized  as  a  switch  C.  Due  to  the  lack  of  gain  in  this  passive  filter,  a 
switch-only  gain-boost  network  provides  gain  without  using  any  amplifier. 

The  block  diagram  of  this  architecture  is  shown  in  Figure  6.18.  Since  the  mixer 
and  sampler  are  merged,  the  output  of  this  mixer/sampler  is  already  the  sampled 
value  x(n).  The  passive  loop  filter  is  labeled  H.  The  1-bit  ADC  is  still  the  same. 
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Fig.  6.19    Circuit  implementation  of  passive  sigma-delta  modulator 


However,  as  will  be  shown,  the  equivalent  input  noise  of  the  comparator  used  in 
implementing  this  1-bit  ADC  becomes  very  important  in  the  present  case  and  hence 
the  1-bit  ADC  is  modeled  with  an  explicit  representation  of  this  noise.  (Remember 
that  in  subsection  6.5.2  we  stated  that  this  noise  is  not  important.)  Also  notice  that 
there  is  a  gain  block  G  that  replaces  the  unity  gain  in  the  1-bit  ADC  used  in  the 
conventional  sigma-delta  modulator. 

To  show  the  importance  of  the  equivalent  input  noise  of  the  comparator,  ecom(«), 
let  us  derive  the  transfer  function  of  the  sigma-delta  modulator.  This  transfer 
function  is: 


Y(z)  =  X(z)  +  E(z)/GH(z)  +  Emm(z)/H(z) 


(6.27) 


Here  H(z)  is  the  transfer  function  of  the  passive  loop  filter,  E(z)  is  z-transform  of 
the  quantization  noise,  £com(z)  is  the  z-transform  of  the  equivalent  input  noise  of  the 
comparator,  and  G  is  the  equivalent  gain  provided  by  the  1-bit  ADC.  Here  G  can  be 
on  the  order  of  thousands.  If  we  keep  on  decreasing  E(z)  (by  increasing  G,  while 
keeping  H  constant),  eventually  the  third  term  in  (6.27)  will  become  larger  than  the 
second  term  and  Ecom(z)  will  dominate  the  overall  noise  contribution  and  limit  the 
overall  DR.  This  is  one  of  the  essential  differences  between  the  passive  and 
conventional  sigma-delta  modulators. 

Figure  6.19  shows  the  new  architecture  implemented  with  a  second-order 
passive  loop  filter  (i.e,  H  is  a  second-order  filter).  This  second-order  filter  is 
implemented  with  2-cascaded  RC  branches,  with  the  R  realized  using  a  switched 
capacitor.  For  example,  R{  is  implemented  with  the  Sml,  C,  i,  and  S„,3  combination. 
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This  together  with  C\  gives  the  first  pole.  The  switches,  Cr2  and  C2,  form  the  second 
pole.  Finally,  Cl0  introduces  a  zero,  cj)]  and  §2  are  the  normal  two-phase  clocks.  <j>la 
is  the  same  as  §x  except  that  the  falling  edge  is  advanced  slightly  (the  same  as  in 
bottom-plate  sampling  as  discussed  in  Chapter  5). 

Next  we  talk  about  the  mixer/sampler  section.  This  consists  of  4>„,2  and  (|>m2a>  x(t) 
and  the  sampling  capacitor  Cri.  Notice  Cr\  serves  both  to  implement  the  loop  filter 
and  the  mixer  section.  4>„,2  and  4>„,2a  are  the  LO/sampling  clock.  In  general,  the  LO 
signal  4>„,2  can  have  the  same  frequency  as  the  sampling  clock  signal  4>2.  However, 
as  discussed  in  Chapter  5,  the  sampling  distortion  introduced  by  the  mixer  is 
dependent  on  the  fall  time  of  (|)„,2a  and  <j)„,2.  Hence  special  buffers  are  introduced 
to  generate  these  waveforms  to  reduce  the  fall  time. 

The  G  of  the  1-bit  ADC  can  be  shown  to  be  approximately  Ci  x  C2/(C,-i  x  C,2)- 
By  controlling  the  ratio  of  these  capacitors,  a  large  G  is  realized  and  is  used  to 
suppress  quantization  noise,  as  evident  in  (6.27).  However,  beyond  a  certain  value, 
further  increase  in  G  will  start  to  have  a  significant  impact  on  H  as  well,  so  that 
G  can  no  longer  be  increased  without  changing  H.  At  this  point  the  third  term  in 
(6.26)  will  start  to  come  into  the  picture  and  should  be  considered  to  determine  an 
optimal  G  to  minimize  the  total  noise  contribution  from  the  second  and  third  terms. 

In  a  particular  implementation  of  this  architecture,  where  a  10  MHz  IF  is 
digitized  [5]  in  a  1.2  «m  CMOS  technology,  the  circuit  as  shown  in  Figure  6.19 
uses  a  capacitor  C,-n  to  introduce  a  zero  at  around  750  kHz  to  compensate  for  the 
phase  loss  of  the  loop  filter  and  improve  the  stability.  C,  2  is  made  small  so  it  will  not 
load  C\  too  much.  Considering  the  loading  effect,  two  poles  are  located  at  8  kHz  and 
34  kHz.  Switches  Sm\  and  S„a,  used  for  mixing/sampling,  have  W/L  —  20  Mm/1 .2  um 
to  reduce  /M3.  /M3  is  further  reduced  by  bottom-plate  sampling  and  a  fast  rise/fall 
time  in  4>„,2a  and  4>„;2-  The  comparator  is  implemented  with  a  preamplifier  and  a 
regenerative  latch.  It  has  a  calculated  rms  value  of  the  quantization  noise,  erms,  of 
41  mV  for  an  equivalent  noise  bandwidth  of  10  MHz  and  a  G  of  40  at  10  MHz. 
The  rms  input  signal  to  the  comparator  is  estimated  to  be  around  147  «V. 

As  noted  in  Figure  6.19,  the  passive  loop  filter  has  no  gain  and  hence  the  signal 
level  at  the  input  of  the  comparator  in  this  10  MHz  IF  digitizer  is  only  on  the  order 
of  147  mV.  This  is  quite  small  compared  with  the  erms  value  of  41  uV.  In  order  to 
increase  this  signal  level  without  the  use  of  an  explicit  amplifier  (which  consumes 
power),  a  gain-boost  network  implemented  using  switches  only  (no  amplifiers)  is 
used  and  is  shown  in  Figure  6.20.  Note  that  the  circuit  enclosed  in  the  dotted  line 
replaces  the  circuit  enclosed  in  the  box  labeled  mixer/sampler  and  the  1-bit  DAC  in 
Figure  6.19.  This  circuit  consists  of  N  sampling  capacitors,  each  labeled  Cr\. 

This  circuit  operates  as  follows.  During  4>2  (sampling  phase),  all  N  sampling 
capacitors  Cr\  are  connected  in  parallel.  The  bottom  plates  are  all  connected  to  x(t). 
Depending  on  whether  the  output  code  y(n)  is  1  or  0,  the  top  plates  are  all  connected 
to  either  Vref~  or  Vref+,  respectively,  thereby  realizing  both  the  1-bit  D/A  conver- 
sion and  subtraction.  Hence  all  N  capacitors  are  charged  to  either  x(t)—Vmf~  or  x 
(?)— Vref+.  Then  during  4>i  (charge-transfer  phase),  the  N  sampling  capacitors  are 
reconfigured  in  a  series  connection  with  an  equivalent  value  of  Cri  —  C,JN. 
This  equivalent  capacitor  is  connected  between  ground  and  the  capacitor  C\  and 
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Fig.  6.20    Gain  boosting  network 
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hence  the  C\  is  charged  toN  x  (x(t)  -Vmi)  orNx  (x{t)  -  Vref+).  Thus  a  voltage  gain 
of  N  is  achieved.  Ideally  the  network  has  a  transfer  function: 


C,-i 


(%^)' 


(6.28) 


It  can  be  shown  easily  that  the  DC  gain  is  N  and  the  pole  is  at 


Bottom-plate  sampling  is  used  in  both  phases  to  reduce  charge  injection  error. 
Analysis  shows  that  the  thermal  noise  of  the  network  is  the  same  as  that  of  a  switch 
connected  to  the  equivalent  series  capacitance.  Even  though  it  is  difficult  for  the 
network  to  achieve  high  gain  due  to  parasitic  capacitance,  a  moderate  gain  of  3  is 
easily  obtained  in  the  design.  This  means  that  our  modulator  can  tolerate  an  enns 
that  is  three  times  larger  for  the  same  DR.  Assuming  that  this  noise  comes  from 
thermal  noise  of  the  input  differential  pair  of  the  comparator,  this  reduction  allows  a 
significant  reduction  in  the  bias  current  of  the  differential  pair  and  hence  power 
consumption  of  the  comparator. 

In  summary,  the  aforementioned  low-power  modulator  has  been  implemented  in 
various  forms,  and  three  such  examples  (together  with  real  measured  performances) 
are  summarized  in  Table  6.1  [5,7,  12].  Notice  that  in  example  1,  2  the  modulators 
have  the  input  SAH  merged  with  the  sampling  mixer  and  therefore  implement  the 
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Table  6.1  Performance  of  sigma-delta  modulators  using  merged  sampler/mixer  and  passive  loop 

filter. 

IF  Bandwidth    Resolution    Power  Distortion  (IM3)  Vdd  Technology 

10  MHz  20  kHz          13  bit            0.25  mW  <  -70  dB  @  - 10  dBm  3.3  V  1.2wn  CMOS 

150  MHz  80  kHz          13  bit            12.5  mW  <  -65  dB  @  -3    dBm  5  V  0.8u  BiCMOS 

400  MHz  40  kHz          12  bit            20  mW  <  -85  dB  @  -14  dBm  5  V  0.8  urn  BiCMOS 


mixer  plus  baseband  ADC  as  depicted  in  Figure  6.4  and  in  Figure  P. 2(a),  Chapter  2. 
The  third  example  [7]  actually  incorporates  the  merged  mixer  inside  the  feedback 
loop,  as  depicted  in  Figure  6.5,  with  the  modulator  still  being  passive.  This  example 
actually  uses  the  feedback  action  inherent  in  a  sigma-delta  modulator  (which  consists 
of  a  quantized  feedback  loop)  to  suppress  the  distortion  generated  by  the  mixer. 
The  details  on  the  nature  of  this  distortion  were  discussed  thoroughly  in  Chapter  5. 


6.6     Bandpass  Sigma-Delta  Modulators 

Let  us  now  turn  our  attention  to  bandpass  sigma-delta  modulators.  Examples 
of  using  bandpass  sigma-delta  modulators  are  shown  in  Figures  6.2  and  6.3.  In 
these  figures,  because  the  IF  signal  is  narrowband  in  nature  it  almost  always  makes 
sense  to  implement  the  passband  ADC  using  a  sigma-delta  modulator  approach. 
In  general,  because  a  passband  ADC  operates  at  IF,  we  have  to  use  a  bandpass 
sigma-delta  modulator.  Since  we  perform  the  digitization  at  IF,  the  A/D  conversion 
occurs  sooner  in  the  receiver  chain.  Hence  we  can  get  rid  of  analog  narrowband  IF 
filters  (replacing  them  with  the  much  easier  implemented  digital  filters). 
As  described  previously,  the  quantization  noise  of  a  low-resolution  quantizer  can 
be  suppressed  selectively  around  specific  ranges  of  frequencies  by  employing 
a  combination  of  oversampling  and  feedback  techniques.  These  techniques  have 
so  far  allowed  us  to  shape  the  quantization  noise  spectrally  while  maintaining  a  flat 
frequency  response  for  a  narrowband  low-pass  input  signal.  The  same  principle 
is  now  extended  to  narrowband  bandpass  signals,  where  the  spectral  shaping  occurs 
around  IF  rather  than  DC.  This  results  in  improved  SNR  and  hence  DR  for 
bandpass  rather  than  low-pass  signals. 

The  advantages  of  conventional  (low-pass)  modulators  over  Nyquist  rate 
converters  are  equally  applicable  to  bandpass  modulators.  Inherent  linearity,  and 
reduced  antialias  filter  complexity  are  among  the  advantages.  In  addition,  the 
design  methodology  of  low-pass  sigma-delta  modulators  can  also  be  applied, 
with  some  modifications,  to  the  bandpass  case.  Consequently,  this  section  is 
devoted  only  to  design  aspects  that  are  unique  to  a  bandpass  modulator. 

Similar  to  a  low-pass  modulator  (shown  in  Figure  6.8),  a  bandpass  (BP) 
sigma-delta  modulator  can  be  constructed  by  embedding  a  loop  filter  inside  a 
feedback  loop,  as  shown  in  Figure  6.21.  The  major  difference  is  that  the  integrator, 
with  gain  k,  is  now  replaced  by  a  resonator  with  resonating  frequency  co0. 
The  resonator  may   be   implemented   using   switched-capacitor  techniques,   in 
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Fig.  6.21    A  general  noise-shaping  bandpass  modulator 

which  case  it  consists  of  two  integrators  connected  back  to  back.  The  resonating 
frequency,  (On,  is  then  determined  from  the  unity  gain  frequencies  <du  of  the  two 
integrators.  Another  implementation  involves  the  use  of  a  passive  loop  filter 
structure,  similar  to  the  passive  low-pass  sigma-delta  modulator  described  in 
subsection  6.5.7,  except  the  RC  filter  is  replaced  by  an  LC  filter  so  that  the  filter 
can  resonate.  The  loop  filter  can  also  be  implemented  by  continuous  time  filter  such 
as  GmC  filters. 

Next  we  show  how  to  start  from  the  low-pass  sigma-delta  modulator  design 
procedure  that  we  are  familiar  with  and  adapt  it  so  we  can  design  a  bandpass 
modulator.  Then  we  highlight  the  limitations  of  this  architecture.  Finally,  we 
consider  a  design  example  for  a  complete  bandpass  modulator  that  is  similar  to 
our  low-pass  sigma-delta  modulator  design  example. 


6.6.1     Comparisons  of  Low-Pass  and  Bandpass  Modulators 


It  has  been  shown  that  a  low-pass  sigma-delta  converter  of  order  L  can  be 
converted  to  a  bandpass  modulator  of  order  2L,  with  a  center  frequency  at  fJA. 
This  transformation  has  one  major  advantage:  both  the  stability  performance 
and  the  noise  properties  of  the  original  low-pass  prototype  that  we  discussed 
previously  are  preserved  in  this  new  topology.  Notice  that  the  bandpass  sigma-delta 
modulator  that  is  used  as  the  passband  does  not  have  a  mixer  in  front  of  it.  Hence 
the  sampler  in  the  modulator  has  to  respond  to  a  high  frequency  (IF)  signal,  making 
its  design  more  demanding  than  in  the  case  of  low-pass  sigma-delta  converters 
(which  are  preceded  by  a  mixer).  On  the  other  hand,  in  order  to  make  a  fair 
comparison,  even  in  the  case  of  a  low-pass  sigma  delta  converter,  if  we  merge  its 
sampler  with  a  mixer,  the  sampler  faces  the  same  challenge  as  in  the  bandpass 
sigma-delta  modulator  case. 

When  compared  with  the  low-pass  sigma-delta  modulator,  we  find  that  the  high- 
pass  NTF  of  a  low-pass  sigma-delta  modulator  becomes  the  band-reject  NTF  in  a 
bandpass  sigma-delta  modulator.  The  quantization  noise  PSD  for  a  second-order 
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Fig.  6.22   N(f)  for  second-  and  fourth-order  BP  modulators 

and  a  fourth-order  bandpass  modulator  is  shown  in  Figure  6.22.  Notice  that  the 
second-order  bandpass  modulator  has  essentially  the  same  quantization  noise  PSD 
as  a  first-order  low-  pass  modulator,  except  that  the  null  frequency  (frequency 
where  the  quantization  noise  PSD  goes  to  zero)  is  moved  from  DC  to  f0,  the 
resonating  frequency  of  the  resonator.  Hence  by  selecting  the  proper  resonating 
frequency,  we  are  practically  selecting  the  quantization  noise  null  frequency. 
Of  course,  we  should  choose  this  to  be  the  same  as/if.  Hence  fQ  =fa. 

In  this  chapter  we  assume  that  the  BP  modulator  is  implemented  in  the  sampled 
data  domain.  Hence  all  the  poles  of  the  resonator  H(z)  (zeroes  of  the  NTF)  are 
described  using  the  z-plane.  We  further  assume  that  the  poles  are  near  or  on  the  unit 
circle  (but  always  inside  to  maintain  stability). 

Figure  6.23  compares  the  pole  placements  of  H(z)  for  a  second-order  low-pass 
modulator  [Figure  6.23(a)]  and  a  fourth-order  bandpass  modulator  [Figure  6.23(b)]. 
Assume  for  the  time  being  that  H{z)  only  has  poles  and  no  zeroes.  Figure  6.23(a) 
shows  that  the  two  poles  are  at  DC.  Figure  6.23(b)  shows  that  the  four  poles  are  at 
±7t/2,  or  at/s/4,  where  fs  is  the  sampling  frequency.  Hence  if  we  set/if  to  exactly  fj 
4,  the  signal  will  be  at  the  quantization  noise  null,  as  desired. 


6.6.2    Low  Pass  Modulator  to  Bandpass  Modulator  Conversion 


It  seems  that  from  an  architecture  point  of  view  the  bandpass  modulator  is  essentially 
the  same  as  the  low-pass  modulator.  The  only  difference  is  that  the  low-pass  loop 
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Fig.  6.23    The  pole/zero  locations  and  passbands  of  the  noise  transfer  functions  for  (a)  low-pass 
and  (b)  bandpass  sigma-delta  modulators 


filter  is  replaced  by  a  bandpass  loop  filter.  Hence  we  first  revisit  this  loop  filter 

transformation,  which  has  been  well  covered  in  many  standard  texts  in  filter  theory. 

As  an  example  of  the  filter  transformation,  let  us  start  with  a  low  pass  filter  H(z) 


with  poles  at  DC  and  apply  the  transformation  z 


.  This  will  map  the  poles  of 


H(z)  from  DC  to  ±  n/2.  The  z  — »  — z2  transformation  is  the  simplest  transforma- 
tion. It  has  one  major  advantage:  if  the  low-pass  prototype  is  stable,  so  is  the 
bandpass  prototype.  Another  example  of  low-pass  to  bandpass  transformations 
involves  the  use  of  an  iV-path  filter. 

An  example  of  such  an  N-path  filter,  with  N  =  3,  is  shown  in  Figure  6.24(a),  and 
its  clocking  waveform  is  shown  in  Figure  6.24b  [11].  The  clocking  waveform  is  an 
A^-phase  clock  [N  =  3  in  Figure  6.24(b)]  with  the  master  clock  <\>  running  at  N  times 
the  clock  of  the  subphases.  We  label  its  frequency  as/v.  Essentially,  the  frequency 
response  of  the  TV-path  filter  is  simply  that  of  the  LPF  together  with  its  images 
centered  atfJN,  2fJN  . .  .fs  x  (N—l)/N.  These  images  provide  the  bandpass  filter 
response  that  we  are  interested  in,  with  center  frequency  at  fJN,  2fJN,  . .  .fs  x 
(N~X)/N.  Hence  we  just  have  to  select  the  proper  image  that  we  want.  The  use  of  an 
A^-path  filter  has  the  advantage  that  each  path  filter  has  more  time  to  settle  [(N—l)T 
more  time].  However,  path  mismatch  can  be  a  problem.  If  this  problem  is  not  taken 
care  of,  it  may  introduce  a  mirror  image  of  the  input  signal  at  a  mirror  location 
centered  around  fJA  [6].  This  path  mismatch  can  come  from  mismatch  in  the  DC 
gain  and  settling  behavior  of  the  op-amps  used  to  realize  the  switch-capacitor 
integrators,  which  are  in  turn  used  to  implement  the  separate  path  LPF.  The  impact 
of  finite  op-amp  DC  gain  and  settling  error  on  the  LPF  transfer  function  is  exactly 
the  same  as  described  in  Section  6.5. 

A  final  example  of  filter  transformation  consists  of  the  use  of  generalized 
second-order  low-pass-to-passband  transformations.  This  allows  us  to  have  full 
control  over  the  passband  location.  However,  a  stable  low-pass  prototype  does 
not  guarantee  that  the  transformed  bandpass  filter  is  also  stable. 

Next  we  extend  the  quantization  noise  analysis  for  the  LP  modulator  to  the  BP 
modulator.  We  can  do  this  by  repeating  the  analysis  in  subsection  6.4.2,  whereby  the 
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Fig.  6.24    (a)  An  example  of  W-path  filter,  N  =  3.  (b)  Clock  signal  for  W-path  filter,  N  =  3 

1-bit  ADC  is  again  replaced  by  an  additive  white  noise  model.  Again  the  PSD  of  the 
quantization  noise  is  assumed  to  be  white  with  a  power  of  A2/12.  Using  this  linear 
analysis,  the  NTF  has  L/2  zeroes  at  co0,  where  L  is  the  order  of  the  BP  modulator. 
The  result  is  that  for  every  doubling  of  the  OSR  the  DR  or  SNR  increases  by  3L  +  3 
dB.  This  is  a  little  better  than  half  of  the  improvement  rate  for  the  low  pass 
modulator,  which  is  6L  +  3  dB/octave  [13].  Hence  we  modify  (6.11)  by  replacing 
the  (2L  +1)  factor  with  anL  +  1  factor  and  develop  a  corresponding  DR  formula  for 
a  BP  modulator.  The  resulting  formula,  with  DR  expressed  in  decibels,  is  as  follows: 


D1?U  =  10  log 


3L+1 


(OSR) 


(L+l) 


(6.29) 


Simplifying  we  have: 

DR\dB=  10  log 


3L+1 


2  jW 


10(L+  1)  log  [OS/?] 


(6.30) 
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6.7     Implementation  of  Bandpass  Sigma-Delta  Modulators 

In  a  bandpass  modulator  we  also  have  the  same  components  as  in  a  low-pass 
modulator.  The  first  component  is  the  loop  filter.  To  implement  this,  let  us  assume 
that  we  decide  that  the  filter  transformation  is  a  z_1  to  -z~2  transformation. 
The  simplest  example  will  be  that  the  initial  low-pass  prototype  is  an  integrator 
whose  transfer  function //(z)  is  z_1/(l—  z~l).  With  the  transformation  H(z)  becomes 
—z~  /(l  +  z~  ).  This  has  poles  at  ±7i/4  of  the  unit  circle,  similar  to  the  case 
described  in  Figure  6.23(b),  and  therefore  H(z)  is  a  resonator. 

6.7.1    Review  of  Switch-Capacitor-Based  Resonators 

We  will  now  discuss  how  to  implement  a  resonator.  Let  us  assume  that  we  implement 
the  resonator  using  the  switch-capacitor  integrator  shown  in  Figure  6.14.  The 
resulting  circuit  is  shown  in  Figure  6.25(a),  where  the  unity  gain  frequencies  of 
each  integrators  are  also  shown  and  are  related  to  the  unity  gain  frequency  of  the 
original  integrator,  denoted  as  ©i. 

By  going  through  the  loop,  we  can  see  that  the  loop  gain  of  the  resonator  is 

d>i  x  —  =  m2.  Now  what  is  the  resonating  frequency  of  this  resonator?  First  we 
show  that  frequency  at  which  the  loop  gain  around  the  resonator  is  unity  is  ©0. 
To  show  that,  just  recognize  the  fact  that  for  integrator  1  (unity  gain  frequency  is 
©i),  the  gain  at  ©,  is  by  definition,  ©/©].  Hence  at  co0,  by  definition,  the  gain  is  co0/ 
©i.  For  integrator  2  (unity  gain  frequency  is  co0  /©i),  the  gain  at  ©  is  ©/(©0~/©i). 
Hence  at  ra0,  by  definition,  the  gain  is  a>J{(i>0"l(iii)  =  ©i/©0.  The  loop  gain  at  ©0  is 
simply  the  product  of  the  two  integrator  gains  at  ©0.  This  becomes  (©„/©!)  x  (©!/ 
©0)  =  1.  Hence  we  show  that  the  loop  gain  around  the  resonator  at  ©0  is  indeed 
unity.  Second,  from  the  definition  of  a  resonator,  the  frequency  at  which  the  loop 
gain  is  1  is  its  resonating  frequency.  Applying  this  definition  to  our  first  conclusion, 
we  can  now  conclude  that  ra0  is  indeed  the  resonating  frequency  of  the  resonator. 
It  can  further  be  shown  that  the  Q  (quality  factor)  of  this  resonator  is  ©q/©!. 

The  aforementioned  resonator  works  fine  but  has  a  bad  component  spread. 
This  can  be  taken  care  of  by  scaling,  so  Figure  6.25(a)  becomes  Figure  6.25(b). 
Here  the  two  integrators  have  identical  unity  gain  frequency,  set  equal  to  ©0,  but 
each  is  preceded  by  a  different  attenuator.  If  the  resonator  implemented  in  Fig- 
ure 6.25(b)  (called  a  biquadratic  structure)  uses  a  type  1  integrator,  the  overall 
transfer  function  becomes 

H(z)  =  Gz-2  /  (I  +  z-2)  (6.31) 

Here  G  is  the  gain  of  the  resonator  [6],  Notice  that  this  function  has  zeroes  in 
addition  to  poles.  Hence  if  we  use  this  to  implement  the  BP  modulator,  the  resulting 
NTF  contains  poles. 
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Fig.  6.25    (a)  Resonator  implemented  using  two  integrators,  (b)  Resonator  with  scaled  integrators. 
Note:  Int  =  integrator 

6.7.2    Non-Ideal  Resonator 


Since  we  implement  the  resonator  using  a  biquadratic  structure,  the  impact  of  op-amp 
settling  error  and  finite  gain  will  manifest  as  a  shift  in/0.  This  is  going  to  lead  to  excess 
quantization  noise,  as  shown  in  Figure  6.26.  The  figure  shows  the  N(f )  of  a  BP 
modulator  using  a  resonator  implemented  with  ideal  components  (left)  and  nonideal 
components  (right).  Notice  that  with  nonideal  components,  the  input  signal  is  sitting  at 
a  frequency  where  there  is  substantial  quantization  noise  and  the  DR  is  reduced.  It  is 
precisely  this  phenomenon  that  places  constraints  on  the  maximum  nonidealities 
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Fig.  6.26   N(f)  with  ideal  and  nonideal  components 


that  can  be  tolerated.  As  seen,  the  increase  in  quantization  noise,  and  hence  the 
reduction  in  DR,  is  a  function  of  A/0,  the  shift  in  the  resonator  frequency.  Next  we 
show  the  relationship  between  this  shift  in  frequency  and  nonidealities  due  to  the 
op-amp,  namely  the  finite  op-amp  gain  and  bandwidth.  This  will  allow  us  to  derive  a 
rule  of  thumb  on  the  allowable  op-amp  gain  and  bandwidth  that  still  yield  an 
acceptable  DR. 


6.7.2.1     Finite  Op-Amp  Gain 

First,  we  assume  that  the  switch-capacitor  integrator  of  the  resonator  is 
implemented  as  shown  in  Figure  6.14,  with  the  same  switches  and  clocking 
arrangement.  In  the  presence  of  finite  op-amp  gain,  A,  the  switched-capacitor 
integrator  implementation  has  the  transfer  function  as  given  by  (6.12).  In  order  to 
separate  the  impact  of  op-amp  gain  on  integrator  gain  and  integrator  pole  location 
we  rewrite  (6.12)  in  the  following  form: 


"M-|('->rHr^F 


(6.32) 


Here  parameters  m  and  p  highlight  the  separate  effects  on  integrator  gain  and  pole 
location,  respectively.  They  are  related  to  op-amp  gain  A  as 


lfuCi 
m  =  AV+C7 


(6.33) 
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and 

Now  let  us  turn  to  the  switch-capacitor  resonator.  It  has  two  identical  integrators. 
Let  us  relabel  these  integrators'  capacitors  as  Cs  and  C\.  Using  these  new  notations, 
(6.32)  through  (6.34)  are  rewritten  as 

H(z)  =  ^  (1  -  m ) -—  (6.35) 

(6.36) 

(6.37) 

Equations  (6.35)  through  (6.37)  are  applicable  to  both  integrators  of  the  resona- 
tor. We  can  now  find  the  resonator  transfer  function  by  applying  (6.35)  through 
(6.37)  to  each  of  the  integrators  in  Figure  6.25.  This  expression  turns  out  to  be  rather 
complicated.  For  illustration  purposes  we  neglect  all  second-order  terms  in  this 
expression  and  set  Cg/Ci  =  1 .  The  simplified  expression  is 


Cs 

(1- 

^         z 

1/2 

c, 

m>\-{\ 

-P)Z-1 

m 

1 

~A 

(-1) 

P  = 

1  Cs 
A  Ci 

H(z)  =  G{\  —  m\  —  m-i) 

(6.38) 


1  +  (Pi  +  P2  —  2mi  -  2m2)z  :  +  ( 1  -  px  -  p2)z  2 

Here  G  is  the  gain  of  the  resonator.  Variables  mu  m2,  p\  and  p2  are  defined  in 
(6.36),  (6.37)  for  integrators  1  and  2.  Notice  that  if  A  becomes  oo,  all  them's  andp's 
become  zero  and  (6.38)  reverts  back  to 

z-2 

H(z)  =  G-—^  (6.39) 

1  +  z  z 

This  is  the  transfer  function  of  the  resonator  implemented  with  ideal 
components,  as  stated  in  (6.31).  If  we  assume  that  m1,  px,  m2,  and  p2  <§C  1,  the 
fractional  deviation  in  the  center  frequency,  f0,  of  the  resonator  is 

A/0     pi+p2-2mi-  2m2 

-—  « (6.40) 

Jo  x 

Notice  that  px,  p2,  mx,  m2  happen  to  cancel  out  one  another,  so  the  overall 
fractional  change  remains  relatively  small.  Furthermore,  it  can  be  shown  that  this 
fractional  change  is  not  very  sensitive  to  the  ratio  Cs/C\.  Hence  we  can  use  (6.40)  to 
predict  the  fractional  change  of  f0  for  a  general  resonator. 
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While  the  tolerable  decrease  in  DR  is  application  dependent,  a  good  rule  of 
thumb  is  around  5  dB.  The  decrease  in  DR  due  to  fractional  shift  of  f0  for  a  BP 
modulator  is  a  function  of  its  order.  The  exact  decrease  in  DR  can  usually  be 
quantified  by  simulation  only.  From  simulation  for  the  DR  of  a  fourth  order  BP 
modulators  to  suffer  less  than  5  dB  reduction,  f0  should  shift  by  less  than  0.2%  [6]. 


6.7.2.2     Finite  Op-Amp  Speed 

As  is  the  case  with  low -pass  sigma-delta  converters,  it  is  important  that  the  outputs  of 
resonators  in  a  BP  modulator  settle  to  the  desired  accuracy.  Now  since  an  individual 
resonator  is  implemented  using  integrators  as  shown  in  Figures  6.25(a)  and  6.25(b) 
and  integrators  are  in  turn  implemented  using  op-amps,  the  finite  op-amps'  speed 
will  affect  the  resonator's  transfer  function.  Assuming  that  the  switched-capacitor 
integrator  used  to  implement  the  resonator  settles  linearly  with  a  single-pole 
response,  incomplete  settling  manifests  itself  as  an  integrator  gain  error.  Hence  as 
with  finite  op-amp  gain  error,  the  settling  error  will  also  result  in  a  shift  of  the  center 
frequency  of  the  resonator.  This  shift  of  f0  will  result  in  extra  quantization  noise 
being  introduced  into  the  passband  and  hence  compromise  the  dynamic  range. 
The  effect  of  linear  op-amp  settling  on  the  resonator  can  be  determined  by  modeling 
each  of  the  two  switched-capacitor  integrators  using  (6.15)  again.  We  will  simplify 
(6.15)  by  approximating  the  factor  1  —  e  +  z~'eC2/(Ci  +  C2)  in  (6.15)  as  1—6,  as 
we  assume  that  the  third  term  in  the  factor  is  smaller  than  the  second  term  and  can  be 
neglected.  Then  we  rename  e  as  g.  Finally,  we  relabel  these  integrators  capacitors  as 
Cs  and  C\.  Then  H(z)  becomes: 


(C/Qz-'/sq-g) 

1  -z- 


H(z)  =  ^;/T      -\  <6-41) 


Next  let  us  re-write  (6.16)  using  these  new  notations  and  it  becomes: 

g  =  e-T^  (6.42) 

Here  T  is  the  sampling  period  and  we  assume  a  two-phase  clock  and  50%  duty 
cycle  for  each  phase,  x  is  given  by 

-        !    C°  +  C>  (6.43) 


2nf,       Cj 


Here/,  is  the  unity  gain  bandwidth  of  the  op-amp.  Notice  that  in  (6.43)  we  no 
longer  assume  that  the  feedback  factor  is  1  and  so  x  is  modified  from  l/2nf, 
by  having  an  extra  factor  (Cs  +  Ci)/CT. 
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We  now  connect  the  two  integrators  back  to  back,  each  having  the  transfer 
function  as  given  in  (6.41).  We  further  assume  that  CJC\  —  1.  When  second-order 
terms  are  neglected,  the  simplified  resonator  transfer  function  is  found  to  be 

H(z)  =G{l-gl-g2)  -  Z  _2  (6.44) 

1  -  (2gi  +  2g2)z     +  z 

Here  G  is  the  gain  of  the  resonator,  and  g\,  g2  are  defined  in  (6.42)  for  integrators 
1  and  2.  Consequently,  the  fractional  shift  in  f0  due  to  incomplete  settling  is 
approximately 

^2gl+2g2  (6.45) 

Jo  X 


6.7.2.3     Resonator  Noise 

The  resonator  noise  effect  on  capacitor  sizing  is  similar  to  the  integrator  noise  case, 
with  the  input  sampling  capacitor  having  to  be  made  largest.  Subsequent  noise 
sources  (after  the  first  integrator  in  the  low  pass  case  and  after  the  first  resonator  in 
the  bandpass  case)  will  see  their  noise  contribution  suppressed  due  to  noise  shaping, 
and  hence  the  size  of  the  associate  capacitors  can  be  reduced  substantially. 

In  the  lowpass  modulator  we  state  that  the  1//  noise  can  be  a  problem.  In  the 
present  case  this  noise,  which  is  generated  mostly  from  gates  of  the  MOS 
transistors,  will  lie  below  the  band  of  interest  of  the  bandpass  modulators  and 
should  be  of  little  concern. 


6.7.3    Design  Procedure 

We  now  examine  the  design  procedure  of  a  BP  modulator  in  a  manner  similar  to 
what  we  did  in  subsection  6.5.6  for  a  low-pass  modulator.  The  design  steps  are  as 
follows: 

1.  From  Chapter  2,  find  the  specifications  for  ADC  in  terms  of  DR,  full-scale 
voltage  (VFs),  bandwidth  of  digitization  (/t,w),  and  the  IF. 

2.  Determine  fs  from  fs  =  4  x  /if  and  OSR  from  OSR  =  M- .  Use  the  following 
formula  [repeated  from  (6.30)]  to  find  the  order  L  that  satisfies  the  required  DR 


for  the  given  OSR:  DR\dB=  10 log 


3  L+\ 


-t-  10(L+  1)  log [05/?].  Notice  that 
this  step  is  different  from  the  low-pass  case. 
3.  Select  a  low-pass  modulator  whose  order  is  L/2  and  that  is  stable.  Determine  its 
NTF  and  convert  this  NTF  to  the  NTF  of  a  bandpass  modulator  using  the  z  — > 
— z2  transformation. 


332  6     Analog-to-Digital  Converters 

4.  Determine  a  block-level  diagram  implementation  of  the  BP  modulator  that  has 
the  NTF  determined  in  step  3.  Each  block  in  the  diagram  should  consist  of  a 
resonator  only. 

5.  Determine  the  minimum  requirement  for  each  block  of  the  BP  modulator. 
The  governing  equations  include  (6.40)  and  (6.45),  from  which  the  op-amp 
gain  and  bandwidth  can  be  determined  that  will  achieve  an  acceptable  Afjf0 
and  hence  acceptable  DR.  The  resonator  noise  dictates  the  capacitor  size  through 
the  use  of  (6.18)  and  the  comparator  has  to  work  at  four  times  the  IF. 

Design  Example  6.2. 

Step  1:  In  this  design  example,  the  same  specifications  are  adopted  as  used  in 
Design  Example  6.1.  Therefore,  following  step  1,  Design  Example  6.1,  /bw  is 
selected  to  be  200  kHz  and  the  DR  is  selected  to  be  72  dB./if  is  20  MHz. 

Step  2:  Unlike  low-pass  sigma-delta  modulator,  fs  is  larger  than  IF./S  =  4  x  /if  =  80 
MHz.  Hence  OSR  is 

L         80  MHz 

OSR  =  ^  = =  200  (6.46) 

2fhw      AQ0  kHz 

To  see  if  one  satisfies  the  DR  requirement,  let  us  take  (6.30),  substitute  in  OSR  = 
200  and  try  L  =  2.  The  resulting  DR  is 


DR\dB=  10  log 


3  2+1 


10(2  +  1)  log [200]  =  63dB  (6.47) 


This  is  not  sufficient  to  satisfy  the  required  DR  of  72  dB.  Next  we  try  L  —  4  and 
we  have 


DR\dB=  10  log 
satisfied. 


3  4+1 


10(4+1)  log  [200]  =  101  dB  and  the  required  DR  is 


Step  3:  From  step  2,  L  =  4,  hence  |  =  2.  Therefore  we  need  to  select  a  second-order 
low-pass  modulator  that  is  stable.  We  decide  to  select  the  low-pass  modulator  in 
Design  Example  6. 1 .  The  NTF  for  this  low-pass  modulator  is  given  in  (6.26)  and  is 
rewritten  here: 

4(1  -z"1)2 
iz  l  —  oz   l  +  4 

Using  the  z  — >  — z2  transformation,  we  obtain  the  NTF  for  the  fourth-order 
BP  modulator: 

-2\2 


4(1 +z-2) 
3z"4  +  6z 


NTF  =  — —A /     -  (6.49) 


Step  4:  To  determine  the  block  diagram  of  a  BP  modulator  that  has  an  NTF  as 
described  in  (6.49),  we  start  with  the  block  diagram  of  an  LP  modulator  that  has  an 
NTF  as  described  in  (6.48).  This  was  given  in  Figure  6.17(b).  We  redraw  it  in 
Figure  6.27(a)  where  the  integrators  are  redrawn  with  their  transfer  functions 
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Fig.  6.27    (a)  LP  modulator  prototype,  (b)  The  final  realization  of  the  fourth-order  BP  modulator 

explicitly  shown.  Now  we  replace  every  z  in  Figure  6.27(a)  with  -z2.  Hence  each 
integrator  whose  transfer  function  is  z~l /(\  —  z_1)  becomes  a  block  whose  transfer 
function  is  —  z~2/(l  +  z~2)-  This  represents  a  resonator.  Next  we  want  to  achieve 
proper  scaling  on  the  bandpass  sigma-delta  modulator  while  maintaining  the  same 
NTF.  Therefore,  the  gains  in  front  of  the  integrators  in  Figure  6.27(a)  are  changed. 
The  final  block  diagram  of  the  BP  modulator  is  shown  in  Figure  6.27(b)  [6]. 

Step  5:  Determine  requirements  of  each  block.  Let  us  assume  that  for  the  present 
application  an  acceptable  DR  reduction  is  5  dB.  Since  from  step  2  we  obtain  L  =  4, 
then  from  subsection  6.7.2. 1  we  know  that  for  L  —  4  and  a  DR  reduction  of  5  dB,  an 
acceptable    tyfi.  —  0.2%.  Let  us  set  Cg/Cj  —  1  in  all  the  integrators. 

(a)  Op-amp  gain:  Assume  that  op-amps  for  all  the  integrators  have  the  same  gain  A. 
Substituting  CSIC,  =  1  in  (6.36)  and  (6.37),  we  find  m  =  2/A  and  p  =  l/A. 
Substituting  these  and    J°/fn  =  0.2%  in  (6.40),  and  we  have 


0.2%  = 


(6.50) 


Solving,  we  have  A  =  954.9.  Rounding  up  we  make  A  —  1000  or  60  dB. 
To  be  exact,  (6.40)  is  derived  with  Csl/Cn  and  C&JCn  both  set  to  1,  where  C$\, 
C$2  are  the  sampling  capacitors  for  integrators  1  and  2  in  the  resonator  described 
in  Figure  6.15(b).  Similarly,  Cn,  Cn  are  the  integrating  capacitors  for 
integrators  1  and  2  in  Figure  6.15(b).  In  the  present  case,  since  the  gains  of 
the  resonators  are  -0.4  and  0.5,  respectively,  (6.40)  may  have  to  be  modified, 
(b)  Op-amp  bandwidth:  We  assume  that  op-amps  in  all  integrators  have  the  same 
unity  gain  bandwidth  and  hence  the  same  settling  error  g.  Since  Cs/Ci  =  1, 
(6.45)  applies.  Substitute  A/b/V  =  0.2%  in  (6.45)  we  have 

2£  +  2g 


0.2% 


(6.51) 
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Solving,  g  =  0.0016.  Next  with/;  =  80  Mhz  (as  determined  in  step  2),  T  =  12.5 
ns.  We  can  then  substitute  g,  T  into  (6.42)  and  we  find  x  =  0.97  ns.  This  is  the 
closed-loop  time  constant  of  the  integrator.  Substituting  this  value  in  (6.43),  with 
Cs  =  C[,  we  find/,  to  be  328  MHz.  Notice  that/,  is  about  4  times  fs.  Also,  this  is 
about  3  to  4  times  the/,  required  for  the  low-pass  modulator  case. 

(c)  Resonator  noise:  Let  us  repeat  (6.18):  DR  =  Vf$  x  JOSR  x  ^.  Now  we  can 
substitute  the  required  DR,  OSR,  and  VFs  obtained  from  previous  steps,  which 
equal  72  dB,  200,  and  1  V,  respectively,  into  this  equation.  Upon  substitution 
we  find  that  a  Cj  of  0. 125  pF  is  more  than  enough  to  meet  the  requirement.  From 
Figure  6.27(b)  we  see  that  the  gain  kx  of  the  first  resonator  is  -0.4.  Therefore, 
Csi/Cn  =  0.4  or  Cs\  —  Ci\  x  0.4  =  0.05  pF.  The  negative  sign  can  be  realized 
by  proper  phasing  of  the  clock.  We  can  design  the  capacitors  of  the  second 
integrator  similarly,  but  with  a  different  gain.  Finally  from  Figure  6.27(b)  the 
feedback  factor  of  the  modulator  is  1,  and  hence  Cref  =  Cs  for  both  resonators. 

(d)  Comparator  requirement:  in  this  modulator  the  comparator  has  to  work  at  a  very 
high  speed  (at  least  four  times  IF).  This  means  that  the  output  code  y(n)  has  to 
be  made  available  in  less  than  half  of  the  clock  period,  or  6.25  ns. 


6.7.4    Bandpass  Versus  Low-Pass  Modulators 

We  now  compare  the  two  architectures  covered  in  Section  6.4  and  6.6.  On  a  system 
level  they  achieve  similar  goals,  with  the  main  distinction  being  whether  mixing  is 
performed  before  or  after  the  A/D  conversion.  In  practical  terms,  bandpass 
modulators  (passband  ADC  in  Figure  6.3)  have  to  sample  faster  input  signals  than 
baseband  modulators  (baseband  ADC  in  Figure  6.4  and  Figure  P. 9(a),  Chapter  2). 
Incidentally,  the  system  in  Figure  6.4  and  Figure  P. 2(a),  Chapter  2,  is  also  called  a 
zero  IF  system,  since  all  the  analog  signal  is  mixed  down  to  DC  (or  zero  IF)  before 
being  demodulated.  A/D  converters  for  zero  IF  systems  need  to  have  small  l/f  noise. 
The  key  observation  is  that  BP  modulator  needs  a  highly  linear  (low  distortion)  input 
SAH  whereas  the  zero  IF  systems  needs  a  highly  linear  mixer. 

If  we  want  to  compare  the  two  modulators  in  terms  of  the  hardware 
requirements,  design  examples  6.1  and  6.2  can  give  a  good  indication.  For  the 
op-amp  gain  requirement,  in  a  bandpass  modulator  the  NTF  null  frequency  is  not 
at  DC  and  so  is  very  sensitive  to  the  op-amp  gain.  Hence  the  DR  is  very  sensitive  to 
op-amp  gain.  On  the  other  hand,  in  a  low-pass  modulator  the  NTF  null  frequency  is 
at  DC,  independent  of  op-amp  gain.  Finite  op-amp  gain  only  changes  the  noise  floor 
of  the  NTF  and  has  a  much  less  dramatic  impact  on  the  DR.  For  the  op-amp 
and  comparator  speed  requirement,  we  should  note  that  as  /if  increases, 
the  speed  requirement  for  a  low-pass  modulator  will  become  less  than  that  of  the 
bandpass  modulator.  The  main  reason  is  that  in  a  bandpass  modulator  fs  is  at  a 
frequency  higher  than/if  (typically  four  times),  whereas  in  a  low-pass  modulator  fs 
is  dictated  by  /bw,  which  is  fixed  for  a  given  standard,  and  is  independent  of  /if. 


6.8     I/Q  mismatch  in  Mixer  and  A/D  Converters  335 

Table  6.2    Comparisons  of  low-pass  and  bandpass  modulators '  hardware  requirements 
Modulator  type  Op-amp  gain  Op-amp  bandwidth  Comparator  speed 

Low  pass  Moderate  Moderate  Moderate 

Bandpass  Moderate  to  high  High  High 

For  DECT,  GSM,  and  other  standards,  once  /if  goes  into  megahertz  range,  fs  for  a 
bandpass  modulator  becomes  larger  than  that  of  a  low-pass  modulator.  This 
difference  in  /s  explains  the  difference  in  operating  speed  and  hence 
requirements  for  op-amps  and  comparators  in  the  two  cases.  The  preceding 
comparisons  are  summarized  in  Table  6.2. 


6.8     I/Q  mismatch  in  Mixer  and  A/D  Converters 
6.8.1     Origin  of  Mismatch  and  Its  Impact 

In  a  wireless  receiver,  since  the  information  is  modulated  using  two  paths, 
quadrature  sampling  is  required  to  separate  the  data  in  these  two  paths,  as  shown 
in  the  IF  digitizer  of  Figure  6.4  or  Figure  6.5.  Following  this,  digital  signal 
processing  is  performed  and  the  demodulated  signal  SDqPsk  xs  generated.  This 
intermediate  frequency  (IF)  digitizer  can  be  adopted  in  a  receiver  using  differential 
quadrature  phase  shift  keying  (DQPSK)  (for  instance,  receivers  for  PACS,  Tetra, 
IS-136,  PDC,  ETSI-DAB  use  DQPSK  [15,  16]).  Figure  6.4  or  Figure  6.5  is  redrawn 
in  Figure  6.28a,  with  I,  Q  paths  explicitly  shown.  Although  ideally  there  are  no 
leakage  of  image  and  crosstalk  between  the  I  and  Q  paths ,  in  practice  finite  imbalance 
between  the  two  paths  does  result  in  leakage  and  crosstalk  [16].  This  imbalance  is 
mainly  due  to  imperfect  matching  of  passive  and  active  components  in  the  two  paths, 
as  well  as  LO  I/Q  imbalance. 

Main  problems  with  I/Q  path  mismatch  in  IF  digitizers  are  [16,  17]: 

1)  Undesired  image  results  in  a  leakage  component  that  occurs  in  the  signal 
channel. 

In  low-IF  systems,  I/Q  mismatch  is  important  since  the  image  channel  is  close 
to  the  frequency  of  the  desired  channel  and  often  cannot  be  removed  with 
prefiltering.  Typically  the  image  channel  may  be  40  dB  higher  than  the  desired 
channel.  With  I/Q  mismatches  this  results  in  an  image  rejection  of  40  dB.  Thus 
the  desired  signal  to  undesired  signal  ratio  is  around  0  dB.  If  the  image  channel 
interferer  looks  like  noise,  this  means  the  SNR  is  around  0  dB.  For  acceptable 
BER  SNR  typically  should  be  larger  than  lOdB.  Consequently,  an  image  rejec- 
tion circuit  is  needed  to  improve  the  image  rejection  by  this  additional  lOdB. 
Referring  to  Figure  6.28a  the  output  Y  of  the  quadrature  modulator  is 

Y  =  I  +  jQ  (6.52) 
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Fig.  6.28    (a)  IF  digitizer  in  a  DQPSK  receiver  (b)  one  implementation  of  part  (a),  used  to  explain 
source  of  mismatches  (c)  gain  and  phase  mismatch  in  part  (a)  explicitly  shown 


Sigma-delta  modulators  are  subject  to  mismatch  errors  in  the  input  and  feedback 
circuits.  To  see  this,  let  us  now  return  to  a  specific  implementation  of  the 
sigma-delta  modulator  architecture  in  Figure  6.5,  which  is  used  to  realize  the  I,  Q 
paths  of  Figure  6.28a.  This  implementation  is  shown  in  Figure  6.28b.  Then  the  input 
and  feedback  circuits  circuits  include  input  resistors  [7],  sampling  mixers  as 
discussed  in  chapter  5,  LO  buffers  to  buffer  VCO,  which  will  be  discussed  in 
chapter  7,  capacitors  and  switches  in  the  loop  filters,  feedback  resistors,  and  1-bit 
feedback  DACs.  It  is  assumed  that  errors  introduced  in  circuits  inside  the  loop  are 
suppressed  by  the  loop  gain  and  consequently  are  less  important.  Thus,  for 
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Table  6.3   Notations  to 
Define  Various  I,  Q 
Components  and  Angles 


Notations 


Definitions 


i,Q,v,\v\,e 
i,q,v,  \v\,e 

Qi„djF(n) 

1 2nd  J  F  symbol  {n), 

QlndJF _symbol{n) 

'transform?    [^transform' 
* transform 

*  process  7    \c process  1 
'process 

Aa,  Ap,  A(jscheme 


Aflh 


A0. 


scheme  \  impair 


Baseband  I,  Q  components  of  transmitted 
symbol,  vector  representation  and  its 
magnitude,  angle  with  x-axis 

Baseband  I,  Q  components  of  received 
symbol,  vector  representation  and  its 
magnitude,  angle  with  x-axis 

I  path  and  Q  path  received  signals  at  2n 
IF  (sampled  and  digitized);  n  is  the 
sampling  index 

A  symbol  period  long  of  hndJF (n), 

QlndJF(ll) 

Baseband  I,  Q  components  of  transformed 

received  symbol  and  the  vector 

representation. 
Baseband  I,  Q  components  of  processed 

received  symbol  and  the  vector 

representation. 
Differential  angle  of  transmitted  dibit, 

received  dibit,  received  dibit  with 

scheme 
Angle  deviation  in  received  symbol  from 

the  bisection  line  of  the  quadrant  in 

which  symbol  is  in,  due  to  mismatch 
Differential  angle  of  received  dibit  under 

impairment  such  as  noise,  received 

dibit  under  impairment  with  scheme 

applied 


AvelTor,  AD error _scheme       ZADh 


Zap,  £±vscheme  impair       zap 


example,  a  mismatch  between  the  ON-impedance  of  the  switches  in  the  loop  filter 
is  considered  to  be  less  important,  when  compared  to  the  input  switches.  Matching 
between  switches  can  be  improved  at  the  expense  of  chip  area  and  proper  layout. 
Problems  associated  with  these  matching  techniques  are  parasitic  delay  and  cross- 
coupling  between  the  matched  switches.  The  input  resistors  for  the  input  adder  are 
outside  the  loop.  Therefore  extra  delay  introduced  does  not  affect  stability.  Thus  the 
layout  of  the  input  resistors  can  be  optimized  for  matching.  On  the  other  hand, 
resistors  and  capacitors  in  the  reconstruction  filters,  which  are  in  the  feedback 
paths,  through  their  parasitic,  introduce  extra  phase  shift  and  this  could  cause 
instability.  This  is  especially  important  as  sampling  rate  increases,  although  it 
can  be  alleviated  somewhat  through  subsampling.  These  mismatches  modify 
Figure  6.28a  to  become  Figure  6.28c.  The  mismatch  error  has  been  modeled  with 
two  unity  gain  blocks  that  have  a  gain  mismatch  a  and  a  phase  mismatch  8, 
resulting  in  an  overall  mismatch  of  A.  Because  of  the  mismatch,  a  leakage  compo- 
nent occurs  in  the  image  band  that  can  be  40dB  down  relative  to  the  signal  carrier.  In 
mathematical  terms,  the  output  signal  can  be  approximated  by 


Y=(I+jQ)+A(I-jQ) 


(6.53) 
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where  the  term  (I+jQ)  represents  the  desired  signal  spectrum,  and  the  term  A(I— jQ) 
is  the  error  spectrum  due  to  themismatch.  These  two  spectra  are  each  other's  mirror 
relative  to  dc.  Accordingly  signals  and  noise  in  the  positive  frequency  (signal  band) 
leak  in  the  negative  frequency  (image  band). 

2)  Crosstalk  is  created  [16]. 

For  ease  of  explanation  we  use  QPSK  signal  (other  signals,  such  as  8-phase 
DPSK,  are  explained  in  [27]): 

Sqpsk  =  I cos  (oct  —  Q  sin  a>ct  (6.54) 

where  I  and  Q  are  the  baseband  I,  Q  components  of  the  transmitted  symbol,  with 
vector  representation  V  and  magnitude  V. 

Table  6.3  summarizes  the  notations  in  this  section.  a>c  is  the  carrier  frequency. 
As  discussed  above,  mismatches  are  explicitly  shown  in  Figure  6.28c,  whereby, 
upon  quadrature  mixing,  and  following  Figure  6.28c,  the  resulting  I,  Q  path  signals 
are  given  as  (1  +  a,)  [Sqpsk  cos(o)ct  +  0,)] ,  (l  +  af/)  [Sqpsk  sin(cocf  +  0q)\ ,  respec- 
tively. Collecting  only  the  baseband  terms  we  have: 

/  =  ( 1  +  a,)  [/  cos  0/  +  Q  sin  0,]  (6.55) 

Q  =  (1+  a?)  [-/ sin  0q  +  Q  cos  6q]  (6.56) 


The  symbol  "A"  in  /,  Q  ,  V,  \V\  designate  that  they  are  received  symbols. 
Since  Q  now  appears  in  the  /  expression  and  I  now  appears  in  the  Q  expression, 
there  is  crosstalk.  Similar  reasoning  explains  how  crosstalk  occurs  with  a  DQPSK 
signal,  which  is  expressed  as: 

Sdqpsk  =  cos((oct  +  0k  +  7i/4)  (6.57) 

Here  0^  =  0£_1  +  A0;  k  (=2)  is  the  symbol  index  and  A9  is  the  differential  angle  of 
the  transmitted  dibit.  A9  changes  the  phase  shift  of  the  carrier. 


6.8.2     Techniques  to  Combat  Problems  Due  to  Mismatch 

Past  techniques  have  been  proposed  to  combat  both  problem  (1):  image  leakage  and 
problem  (2):  crosstalk,  as  well  as  (1)  or  (2)  alone. 

In  [17]  a  DEM  based  approach  has  been  proposed  to  combat  mainly  problem  (1). 
Hardware  has  been  built  to  operate  the  scheme  under  practical  impairments. 
It  works  by  swapping  the  two  paths,  depending  on  the  relative  sign  of  the  I,  Q 
sigma  delta  modulators.   Specifically  the  two   1-bit  output  of  the  sigma-delta 
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modulator  are  compared.  Depending  on  whether  they  are  same  or  not,  swapping  is 
performed.  With  a  1-bit  output,  comparison  is  easily  performed  using  an  EXOR 
gate.  Swapping  is  done  on  a  per  bit  basis  (bit  timing).  The  DEM  scheme  is  simple 
in  implementation  but  is  applicable  to  IF  digitizer  using  sigma-delta  modulator  with 
1-bit  quantizer  only. 

Compensation  based  approaches  have  been  proposed  to  combat  problem  (1)  and 
(2)  in  the  IF  digitizer  part  of  the  receiver  as  well  as  the  entire  receiver  itself 
([18]— [25]).  These  typically  involve  estimating  the  gain  and  phase  error,  with  a 
calibration  loop.  The  circuitry  in  the  receiver  are  made  programmable  in  gain  and 
phase,  and  they  are  adjusted  accordingly.  This  is  an  adaptive  process,  and  some 
kind  of  adaptive  signal  processing,  like  least  mean  square  method  (LMS)  is  adopted 
so  that  the  gain  and  phase  are  adjusted  to  compensate  for  the  mismatch.  Because  it 
is  adaptive  in  nature,  compensation  is  not  instantaneous,  but  a  certain  convergence 
time  is  required.  In  general,  they  are  complex  and  involve  doing  up  to  eight  full 
multiplications  per  sample.  In  [18]  only  simulations  (but  no  hardware)  were  used  to 
test  the  operation  of  the  scheme  for  QPSK,  GMSK  and  16-QAM  modulation.  [18] 
measures  imbalances  using  a  self-generated  test-tone  and  corrects  them  at  start-up. 
It  requires  the  generation  of  a  signal  with  a  frequency  of  four  times  the  local 
oscillator  frequency  and  possibly  an  extra  filter  to  suppress  spurious  tones.  [19]- 
[21]  uses  LMS  based  adaptive  filter  to  correct  for  imbalances  for  16-QAM  and 
QPSK  modulation.  [19]  uses  an  LMS  based  adaptive  filter  to  compensate  the 
mismatch  in  the  IF  digitizer  part  of  the  receiver.  In  [20]  and  [21]  simulations  and 
hardware  for  compensation  were  used  to  test  the  operation  of  the  scheme.  [20]  and 
[19]  are  similar  but  [20]  compensates  the  mismatch  in  the  receiver  (of  which  the  IF 
digitizer  is  a  part  of  )  and  the  implementation  for  compensation  is  complicated, 
involving  a  32-bit  floating  point  signal  processor  operating  at  33MHz.  [21]  uses  an 
analog-digital  adaptive  image-reject  technique  to  compensate  the  mismatches.  It 
compensates  the  mismatch  in  the  receiver  and  the  implementation  for  compensa- 
tion is  complicated,  involving  D/A  converter  and  digital  multiplier.  However  both 
[20],  [21]  do  not  need  training  signal  and  on-line  correction  is  possible.  [22]  reports 
its  own  calibration  has  some  restriction.  [23] 's  calibration  needs  an  external  image 
tone.  [24]  needs  training  signal.  [25]  focuses  on  the  IF  digitizer  part,  and  uses  sign- 
bit  only  in  the  compensation  algorithm,  which  can  potentially  be  unstable.  This  is 
summarized  in  the  Table  6.4. 

Finally  dynamic  quadrant  swapping  approach  [26,  27]  has  been  proposed  to 
combat  mainly  problem  (2).  Like  dynamic  quadrant  swapping  reported  in  [17],  this 
also  performs  swapping  on  a  dynamic  basis.  However  unlike  [17],  it  swaps  the 
received  vector,  rather  than  the  1-bit  output  from  the  sigma-delta  modulator.  Thus  it 
swaps  on  a  per  symbol,  rather  than  per  bit  basis.  Secondly  since  the  whole  symbol  is 
swapped,  no  assumption  is  made  about  the  ADC  architecture.  Therefore  it  can  be 
applied  to  ADC  other  than  sigma-delta  modulator,  for  example,  pipelined  ADC  as 
reported  in  [8].  Here  the  receiver  architecture  is  assumed  such  that  for  the  IF  under 
consideration  the  preceding  channel  filter  suppresses  the  image  sufficiently  so  that 
problem  ( 1 )  is  no  longer  the  main  problem,  although  the  technique  also  corrects  for 
some  of  the  image  leakage  associated  with  problem  (1)  too.  [26,  27]  proposes  a 
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scheme  that  is  simple  in  implementation.  For  ease  of  explanation,  let  us  follow 
6.8.1,  and  use  DQPSK  as  an  example.  The  flowchart  in  Figure  6.29  explains  the 
algorithm  part.  The  notations  are  explained  in  Table  6.3. 

Let  us  start  by  considering  the  case  when  dibit=01.  With  dibit=01  (Ij,  Ch)  lies  in 
the  first  quadrant  and  (I2,  Q2)  lies  in  the  second  quadrant.  Applying  this  information 
to  (6.55),  (6.56)  and  we  have  the  following  equations  for  the  two  received  I(t),  Q(t): 

1 1  =  ( 1  +  a,)  [cos  0;  +  sin  0,-]  (6.58) 

Gi  =  (1  +«,)  [cos0?  -  sin0?]  (6.59) 

h  =  (1  +  <*)[-  cos  °i  +  sin  ft]  (6-6°) 

Q2  =  (l+atl)[cosOq  +  smOq]  (6.61) 

When  (h,Qi),  (Ii_2nd_iF>  Qi_2ndjF)  are  applied  to  the  flowchart,  they 
correspond  to  the  (7,  Q),  (IzndjF.  QindJF)  input  to  the  quadrant 
detection  box.  The  quadrant  detection  box  takes  (I\,Q.\),  as  represented  by 
(6.58),  (6.59),  and  detects  the  quadrant  to  be  one.  The  quadrant  detection  box 
then  states  that  we  do  not  perform  processing  and  just  pass  (/i,Qi)as  is. 
Therefore  in  this  case  (h,Qi)  corresponds  to  the  (/,  Q)  input  to  the  A0  detection 
box  in  Figure  6.29.  Next,  let  us  consider  received  symbol  2.  When  (l2,Qi), 
(l2_2nd_iF>  QuindJF)  are  applied  to  the  flowchart  in  Figure  6.29,  they  now 
correspond  to  the  (I,  Q),  (i2nd_iF,  0.2nd jf)  input  to  the  quadrant  detection  box. 
The  quadrant  detection  box  takes  (l2,Qi),  as  represented  by  (6.60),  (6.61), 
and  detects  the  quadrant  to  be  two.  The  quadrant  detection  box 
then  states  that  we  do  perform  processing.  This  is  done  by  passing  (i2_2nd_iF> 
QuindJF)  to  the  dotted  box,  to  be  processed.  We  henceforth  denote  the  received 
symbol  that  comes  out  of  the  dotted  box  as  the  processed  symbol, 
or  (Iprocess,Qprocess)-  Q 'process,  Qprocess)  is  at  baseband.  In  the  present 
case  (I process,  Qprocess)  correspond  to.  (h process,  Qi ,„,,„,,)■  V2process  is  the  vector 
formed  by  (h process,  Qi  „„„„)■  Thus  the  input  to  the  A9  detection  box  is 
{Iiprocess,Qi  „„„)>  or  V2 process  •  Examining  the  dotted  box,  it  is  seen  that 
processing  consists  of  transforming  followed  by  swapping.  In  the  present  case 

(l2nd_IF>    QlniJF)    Corresponds^  tO    (l2_2nd_IF>    QlJlndJF)    and    (I transform,    Qtransform) 

corresponds  to  (J2  transform,  Qi  transform)-  Let  us  consider  for  the  time  being 
I2_2nd_iF-  Transformation  is  done  by  first  shifting  l2_2nd_iF  by  90°  (i.e.  lA  of  the 
period  of  the  second  IF  signal,  denoted  as  T2nd_iF)-  We  then  mix  the  shifted  signal 
with  a  2nd  LO,  at  frequency  coc2  (since  the  signal  is  already  in  digital  domain, 
mixing  is  done  digitally),  and  then  collect  baseband  terms.  The  result  is  denoted 
as  I2  transform-  Physically  the  90°  shifting  can  be  explained  as  follows.  We  first 
write  i2_2nd_iF  explicitly  as  l2_2nd_iF(n)>  where  n,  the  sampling  index,  highlights 
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Fig.  6.29    Flowchart  of  algorithm 

that  l2_2nd_iF  is  a  sampled  (and  digitized)  signal.  Its  sampling  frequency  is  denoted 
as  fs  i.e.  its  sampling  period  is  Ts.  90°  shifting  means  shifting  n  by  (l/4)x(T2nd_iF/ 
Ts)  and  the  result  is  l2_2ndJF{n  —  z  2^~rF).  Next  we  consider  Q.2_2ndJF-  The  above 
is  repeated,  except  we  shift  by  -90°.  The  result  is  denoted  as  Q2  transform- 
To  see  the  effect  of  the  90°  shifting,  let  us  consider  its  impact  on  the  I  and  Q  path 
signals,  starting  from  the  1st  IF  signal.  In  the  present  case  the  1st  IF  signal 
corresponds  to  S2qpsk-  It  is  more  convenient  to  represent  the  signal  along  the  IF 
digitizer  in  continuous  time  domain  i.e.  keep  treating  the  A/D  converter  as  a  gain 
block  with  gain  one.  Thus,  without  shifting,  the  I-path  signal  can  mathematically  be 
written  as  (1  +  a;)  [S2qpsk  cos(mct  +  0;)]  and  the  Q-path  signal  as  (l  +  a?) 
[S2QPSK  sin(ct)c?  +  9j)] .  With  90°  shifting  they  are  modified  to  (1  +  a,)  [S2qpsk 
cos(a>ct  +  Oi  -  90")],  (1  +  aq)  [S2qpsk  sin(coct  +  0q  -  90°)] ,  respectively. 

To  derive  1 2  transform  and  Q2  transform  mathematically,  we  start  from  (6.54).  With  I 
(t)  =  I2=  —1  and  Q(t)  =  Q2  =  1,  we  have  S2qpsk(i)  —  (— cos a>ct)  —  sin mct. 
We  substitute  this  for  S2qpsk  m  the  above  modified  mathematical  representations. 
We  then  mix  and  collect  only  the  baseband  terms,  and  we  obtain: 


htransfprn,  =  -(1  +  «i)  [COS  Oj  +  sin  0,- 


(6.62) 
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Qltransform  =  (l  +  Kq)  [cOS  6q  -  Sin  0q]  (6.63) 

Next  we  take  the  negative  of  each  of  I2  transform  and  Q2  transform'  swaP  them: 

Ilprocess  =  -Qltransfprm  =  -(1  +  <%<?)  [cos  0?  -  sin  0q]  (6.64) 

G2procm  =  -htransform  =  (1  +  «i)  [cos  0,  +  sin  0{]  (6.65) 

Remember  in  this  case  (hprocessi  Qiprocess)  corresponds  to  the  (/,„„„,  Qprocess) 
input  to  the  A0  detection  box  in  Figure  6.29.  With  (I\,  Qi)  and  (hprocess,  Qipmcess) 
applied  to  the  A8  detection  box,  A6,  at  the  output  of  this  box,  has  its  cosine  given  by: 

/  .  n\         *\     '2  process  +  \2\  '  Hi  process  ,,  £/cx 

cos(Ay)  = ' ,  ,  ,,  , r-5 (6.66) 

1/  T/  I 

I  v  1  I  I  *  2  process  \ 

Substituting  (6.58),  (6.59),  (6.64),  (6.65)  in  (6.66),  the  numerator  of  (6.66) 
becomes: 

numerator  =  — (1  +  a,)  (l  +  ccq)  (cos  0,  +  sin  Of)  (cos  cos  0q  —  sin  09) 
+  (1  +  a,)(l  +  a?)(cos0,  +  sinO,)(cosO?  —  sin0?) 
=  0  (6.67) 

Thus  cos(A8)  is  identically  zero  for  arbitrary  0,  and  0q.  A  A9  of  90°,  the  correct 
value,  is  detected  for  all  mismatch  values. 

The  application  of  the  algorithm  to  the  case  for  other  dibits  can  be  explained 
similarly. 


6.9     Appendix:  Low- Voltage  Low-Pass  Modulator 

So  far  in  this  chapter  we  have  achieved  low-power  dissipation  in  a  sigma-delta 
modulator  via  the  use  of  a  passive  loop  filter.  Another  approach  is  to  minimize 
the  power  supply  voltage  Vdd-  Various  low  voltage  designs  have  been  proposed 
[28,  29].  Among  them,  the  one  discussed  in  [29]  achieves  low  voltage  (Vdd  =  1.95 
V)  operation  by  using  a  new  architecture  called  a  local  feedback  loop.  Here  a  high 
order  modulator  is  stabilized  by  a  local  feedback  loop  around  each  integrator. 
Unlike  multistage  architecture,  this  architecture  becomes  very  tolerant  of  the 
modest  gain  from  low-voltage  op-amps.  We  spend  the  rest  of  this  appendix 
discussing  this  approach. 

In  stabilizing  typical  high-order,  single-stage,  single-bit  modulators  by  decreas- 
ing integrator  gains,  an  undesirable  effect  is  that  their  dynamic  range  is  reduced. 
The  stable  multistage  (MASH)  modulator  with  first-  or  second-order  individual 
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Fig.  A.l    Block  diagram  of  low-voltage  sigma-delta  modulator 


stages  requires  high  gain  op-amps  to  achieve  the  necessary  quantization  noise 
cancellation.  High-gain  cascode  op-amps  are  not  attractive  choices  at  low  supply 
voltage  because  of  the  further  loss  in  output  swing.  Although  high-gain  op-amps  in 
low  supply  voltage  can  be  realized  with  cascaded  stages,  they  tend  to  consume  more 
power  due  to  the  larger  number  of  current  branches.  With  this  new  architecture,  it 
can  tolerate  low  gain  op-amps,  which  are  readily  realized  at  low  supply  voltage  and 
hence  achieve  low  power  dissipation. 

Figure  A.  1  shows  a  third-order  example  of  this  low-voltage  modulator  scheme 
[4].  Apart  from  the  three  local  feedback  loops  (each  consisting  of  an  overload 
detector  [OVL],  a  local  tri-level  DAC,  and  scale  factors  [1;  or  12  or  13)]  and  the  three 
digital  compensators  H1,  H2,  Hj,,  the  basic  circuit  is  a  third-order  single-stage 
modulator  (i.e.  the  same  as  in  Figure  6.13,  with  the  same  stability  problem). 
When  the  integrators  operate  within  their  normal  range  (-Vovi,  V0vt),  the  OVLs 
output  0  (Figure  A.l)  and  therefore  both  the  local  feedback  loops  and  the  digital 
compensators  are  not  activated.  In  essence,  the  modulator  operates  just  like  an 
ordinary  third-order  single-stage  modulator  that  is  very  tolerant  of  modest  op-amp 
gain  and  component  mismatch  but  is  prone  to  instability,  particularly  at  high 
input  levels.  If  an  exceedingly  large  integrator  level  is  detected  by  any  of  the 
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Table  A.l    Performance  of  local  feedback  low  voltage  modulator 


Bandwidth 

Resolution 

Power 

Distortion  (HD3) 

vdd 

Technology 

8  kHz 

12  bit 

0.34  mW 

-95dB@-5dBm 

1.95  V 

1.2  urn  CMOS 

OVLs,  the  corresponding  integrator  is  forced  back  to  the  normal  operating  range 
(_K)vi>  K>vi)  via  me  tri-level  DAC.  Note  that  by  activating  this  local  feedback  loop 
we  have  injected  an  extra  signal  into  the  basic  third-order  loop,  thereby  degrading 
the  SNR.  Therefore,  an  associated  digital  compensator  (Hi,  H2,  H3)  is  introduced 
and  designed  such  that  its  transfer  function  matches  that  of  the  transfer  function 
corresponding  to  the  path  from  this  tri-level  DAC  to  y(n).  By  doing  so,  the  extra 
signal  injected  into  the  basic  third-  order  loop  is  canceled  digitally  at  the  final  output 
and  the  original  SNR  of  the  basic  modulator  is  restored. 

A  tri-level  DAC  can  be  realized  with  good  linearity  in  a  differential  architecture. 
The  digital  compensators  can  also  be  approximated  in  the  baseband  by  the 
simple  FIR  filters: 

//1(z)=^z-3  (A.1) 

./i 

f/2(z)^z-2(i-r')  (A.2) 

H,{z)=^-z-\l-z-Y  (A3) 

The  accuracy  of  this  digital  compensation  does  depend  on  the  matching  between 
the  analog  modulator  and  the  digital  compensators  as  in  a  MASH  architecture. 
However,  since  in  reality  the  OVLs  are  invoked  relatively  infrequently,  the  require- 
ment on  the  matching  is  relaxed.  Therefore,  simple  op-amps  (without  cascode 
devices  or  cascaded  stages)  having  only  modest  DC  gain,  but  readily  realized 
even  at  low  supply  voltage,  can  be  used  as  in  ordinary  single-stage  modulators 
without  the  inherent  stability  problem. 

In  general,  the  modulator  k,  j,  and  /  coefficients  have  to  satisfy  certain 
relationships  that  are  sufficient  (but  not  necessary)  conditions  for  stability.  Practi- 
cally, however,  some  of  these  criteria  can  be  relaxed  to  accommodate  easier 
implementation  and  better  SNR  performance  without  jeopardizing  the  modulator 
stability.  One  of  the  key  issues  is  to  select  the  coefficients  such  that  most  of  the 
overloadings  for  high  input  level  would  occur  at  OVL2  and  OVL3  rather  than  at 
OVL;.  Such  a  modulator  has  been  implemented  with  measured  performance  [4]  as 
summarized  in  Table  A.  1 . 
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Fig.  P.6.1    Modified  second  order  sigma-delta  modulator 


6.10    Problems 

6.1  In  this  problem  we  investigate  the  time  domain  response  of  a  sigma  delta 
modulator.  In  the  following  cases,  assume  that  the  input  Vm  consists  of  a 
ramp  that  goes  from  0  to  VF$  in  50  clock  cycles  (assume  VFS  =  1  V).  Plot 
Vout  for  these  50  clock  cycles  in  the  following  cases: 

(a)  A  first-order  sigma-delta  modulator  with  a  2-bit  internal  quantizer 

(b)  A  second-order  sigma-delta  modulator  with  a  1-bit  internal  quantizer 

(c)  A  second-order  sigma-delta  modulator  with  a  2-bit  internal  quantizer 

6.2  Find  the  signal  transfer  function  (STF)  and  noise  transfer  function  (NTF)  of  a 
modified  second-order  sigma-delta  modulator.  This  modulator  has  a  loop  filter 
in  the  forward  and  feedback  path  (shown  in  Figure  P.6.1).  Note  thatX(z)  is  the 
input  and  Y(z)  is  the  output. 

6.3  Show  that  the  total  quantization  noise  power  in  the  band  of  interest  (-fN/2,fN/2) 
of  an  L  th-order  low-pass  sigma-delta  modulator,  assuming  a  white  noise 
model,  is  given  by 


S-L 


2L+  1 


2L+1 


where  L  is  the  order  of  the  sigma-delta  modulator.  Here  fN  is  the  Nyquist 
frequency,  erms2  is  the  quantization  noise  power  of  the  quantizer  inside 
the  modulator,  and  the  oversampling  ratio  (OSR)  =  fJfN  is  assumed  to  be 
much  larger  than  1 . 
6.4  The  block  diagram  of  a  cascaded  modulator,  in  which  a  second-order  sigma- 
delta  modulator  is  cascaded  with  a  first-order  modulator,  is  shown  in  Figure 
P.6.2.  The  blocks  labeled  z"1  are  delay  blocks.  Assume  that  the  quantization 
noise  of  the  internal  ADCs  can  be  modeled  as  additive  white  noise  and  that  the 
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X(z) 


Analog]  Digital 


Fig.  P.6.2    Multi-Stage  sigma-delta  A/D  converter 


integrators  have  a  transfer  function  of  l:___] .  Assume  D/A  is  modelled  with 
gain  1  (and  no  delay),  just  like  in  the  chapter.  Assume  NTF  and  STF  for  the 
individual  modulators  are  the  same  as  in  the  chapter  (i.e.  klaklbgainA/D=l). 

(a)  Derive  an  expression  for  the  output  Y(z)  in  terms  of  the  input  X(z)  and  the 
quantization  noise  of  the  two  ADCs,  E\{z)  and  Ei(z).  Simplify  the  final 
result  for  the  case  when  g\\\k\.Ji\\,  —  1  (perfect  matching). 

(b)  What  is  the  increase  in  the  quantization  noise  of  the  output  due  to  the 
presence  of  a  mismatch  characterized  by  the  factor  8=1  -  gi/i&ia&ib? 
Express  your  answer  in  terms  of  the  following  parameters: 
as,gi,OSR,  (Tq\,(Tq2,  where  as  is  the  standard  deviation  of  8  and  cL, 
(Tg2  are  me  quantization  noise  power  of  the  two  internal  quantizers, 
respectively. 

6.5  For  the  architecture  shown  in  Figure  P.6.2,  evaluate  an  expression  for  the 
increase  in  quantization  noise  due  to  the  finite  gain  of  the  first  integrator's 
op-amp.  The  transfer  function  of  the  integrator  with  finite  op-amp  gain  is 


h(z) 


(1 


l)(l  +  H)  +  <z[i 


where  /,(  =  i ,  A  is  the  gain  of  the  op-amp,  and  a  is  the  ratio  of  the  sampling 
capacitor  to  the  integrating  capacitor. 
6.6  For  the  third-order  interpolative  sigma-delta  modulator  shown  in  Figure  P6.3 
the  transfer  function  of  the  integrators  is  /(z)  =  (z  —  1)~  .  Derive  the  STF  and 
NTF. 
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Fig.  P.6.3    Third-order  interpolative  sigma-delta  modulator 


6.7  Repeat  the  design  of  the  low  pass  sigma-delta  modulator  given  in  Design 
Example  6.1,  except  this  time  we  have  the  following  specs:  /ir  =  20  MHz, 
/bw  =  4  kHz,  DR  =  78  dB,  power  supply  =  5  V.  VFs  =  2V. 

6.8  In  Figure  6.27  of  Design  Example  6.2  we  have  indicated  a  block-level  diagram 
implementation  of  a  resonator  in  a  BP  modulator.  The  resonator  has  transfer 
function  Gz~  1(1— z~  ),  where  G  is  the  gain  of  the  resonator,  using  two 
integrators  whose  transfer  functions  both  equal  z_1/(l— z~  )■ 

(a)  Using  integrators  whose  transfer  function  are  z  /(l— z_1),  show  a  block- 
level  diagram  implementation  of  the  same  resonator. 

(b)  Modify  the  integrator  as  shown  in  Figure  6. 14  so  that  it  has  the  new  transfer 
function  z  /(l— z_1)  and  show  the  new  circuit  schematic.  Show  clearly 
the  clock  phasing  in  the  circuit  schematic. 

(c)  Translate  the  block  level  diagram  obtained  in  (a)  into  a  circuit  schematic. 
The  circuit  should  use  the  integrator  obtained  in  (b).  Show  clearly  the  clock 
phasing  in  the  circuit  schematic. 

6.9  Repeat  the  design  of  the  BP  sigma-delta  modulator  as  given  in  Design  Example 
6.2,  except  this  time  use  the  following  specs:  /if  =  20MHz,/bw  =  4  kHz,  DR  = 
78  dB,  power  supply  =  5  V,  Vfs  =  2  V,  tolerable  reduction  in  DR  =  5  dB.  In 
section  6.7.2.1  we  stated  that  for  L  —  4,  in  order  to  achieve  a  DR  reduction  of 
less  than  5  dB,  the  fractional  change  in/0  (A/  Jf0  )  must  be  less  than  0.2%.  In 
the  present  problem  (which  may  involve  a  different  L),  what  is  the  tolerable 
fractional  change  in/0?  (Hint:  Instead  of  doing  simulation,  you  may  find  this  by 
comparing  the  NTFs  of  the  two  BP  modulators.)  This  new  fractional  change 
should  be  used  in  the  problem  for  proper  calculation. 
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Chapter  7 

Frequency  Synthesizer:  Phase/Frequency 

Processing  Components 


7.1     Introduction 

As  discussed  in  Chapter  2,  a  receiver  front  end  consists  of  two  blocks:  a  signal 
conditioning  block  and  a  signal  controlling  block.  We  have  discussed  the 
components  needed  for  the  signal  conditioning  block  in  the  last  few  chapters: 
how  the  LNA  amplifies  the  small  RF  input  signals,  the  mixer  mixes  down  the 
high-frequency  RF  input  signal,  and  the  A/D  converter  digitizes  the  mixed-down 
signal.  In  addition,  we  have  assumed  that  someone  is  going  to  generate  the  LO 
(local  oscillator)  signal  for  the  mixer.  It  turns  out  that  the  requirement  on  this  signal 
is  not  trivial,  as  discussed  in  the  Chapters  4  and  5.  This  is  because  any  impairment 
on  this  LO  signal's  integrity,  such  as  noise  or  finite  fall  time  of  the  clock  edge, 
will  adversely  affect  the  integrity  of  the  mixed-down  signal.  Accordingly,  a  well- 
controlled  oscillator  is  required  to  attain  this  requirement.  Henceforth  this 
subcomponent  of  the  receiver  front  end  is  denoted  as  the  frequency  synthesizer. 
In  addition,  since  there  are  multiple  users  in  the  system,  we  need  a  way  of  allowing 
each  user  access  to  the  same  shared  transmission  medium  (air  in  this  case).  This 
multiple  access  control  can  be  implemented  through  frequency,  time,  or  code 
division  multiplexing.  In  the  case  of  frequency  division  multiplexing,  the  frequency 
synthesizer  also  carries  out  the  access  control  function. 

Traditionally  there  are  two  major  ways  of  generating  frequencies:  direct  and 
indirect  approaches.  The  direct  approach  consists  of  generating  a  sine  wave  digitally 
(by  storing  the  sine  wave  in  a  read  only  memory  (ROM),  for  example)  followed  by 
digital  to  analog  (D/A)  conversion.  Even  though  this  approach  enjoys  the  advantage 
of  being  mostly  digital  and  is  therefore  easy  to  design,  manufacture,  and  test,  it  needs 
a  high-speed  D/A  converter,  which  can  be  the  major  bottleneck.  In  addition,  it  tends 
to  consume  high  power  and  is  therefore  not  considered  further  in  this  book.  Instead 
we  look  at  the  indirect  way  of  generating  frequencies:  the  use  of  a  controlled 
oscillator.  The  general  idea  is  to  control  the  frequency  of  oscillation  via  some 
external  control  input  (usually  voltage),  and  hence  the  resulting  oscillator  is  called 
a  voltage  controlled  oscillator  (VCO).  Since  the  noise  property  and  frequency 
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stability  of  such  a  VCO  is  usually  rather  poor,  it  is  typically  put  in  a  feedback  loop  to 
enhance  these  properties.  Since  the  loop  tracks  the  phase  (and  hence  frequency),  it  is 
called  a  phase  locked  loop  (PLL)  and  so  the  approach  is  denoted  as  a  PLL-based 
frequency  synthesizer.  This  approach  is  the  main  focus  in  this  book. 

For  low  system  cost  and  portability,  frequency  synthesizers  need  to  be  integrated 
with  an  on-chip  VCO.  The  difficulty  of  integrated  VCO  lies  in  its  poor  frequency 
stability  compared  with  external  high-Q  resonator-based  VCO.  The  phase  noise  of 
the  integrated  VCO  also  contributes  directly  to  reducing  the  signal-to-noise  ratio 
(SNR)  of  the  receiver  front  end.  The  phase  noise  (which  we  define  more  precisely 
later)  problem  can  be  alleviated  partially  by  broadbanding  the  PLL.  Essentially, 
the  feedback  action  allows  the  PLL  to  suppress  the  inband  VCO  phase  noise 
by  high-pass  filtering  it. 

This  chapter  begins  with  a  discussion  of  a  typical  integrated  frequency 
synthesizer,  followed  by  some  of  its  subcomponents:  namely,  the  phase  detector 
(PD),  divider,  and  VCO.  These  subcomponents  are  new  in  this  book  in  the  sense 
that  they  allow  us  to  process  the  frequency/phase  as  opposed  to  the  amplitude 
(either  voltage/current)  of  a  signal.  We  pay  special  attention  to  familiarizing 
readers  with  the  components  possessing  this  property.  The  nonidealities  of  these 
components  will  have  an  impact  on  the  synthesizer.  Again,  we  focus  on  these 
nonidealities'  impact  on  the  frequency/phase  properties  of  the  output  signals  from 
these  subcomponents  (e.g.,  spur  from  PD,  phase  noise  from  VCO).  The  loop  filter, 
which  can  be  looked  at  as  the  controller  of  the  synthesizer,  will  be  discussed  in 
Chapter  8  as  its  role  is  to  coordinate  these  other  subcomponents  to  achieve  the  goal 
of  frequency  synthesizing  with  the  least  impairment. 

7.2     PLL-Based  Frequency  Synthesizer 

Most  books  on  PLL  [13,14]  focus  on  aspects  that  cover  the  use  of  PLL  in  general 
applications,  such  as  data  acquisition  and  carrier  recovery.  This  chapter 
concentrates  on  and  highlights  those  aspects  of  PLL  that  are  unique  to  frequency 
synthesizer  application.  For  example,  the  use  of  phase  frequency  detector  (PFD), 
although  not  suitable  for  bit  extraction,  is  universally  used  in  the  frequency  synthe- 
sizer, and  is  emphasized  here,  as  is  the  use  of  a  charge  pump.  Again,  aspects  that  are 
unique  to  frequency  synthesizer  applications,  but  less  relevant  to  other  PLL-based 
applications,  such  as  the  divider,  are  given  extensive  treatment.  Even  in  discussion 
of  subcomponents  usually  found  in  a  PLL  book,  such  as  VCO,  the  unique  environ- 
ment facing  its  design  in  the  present  application  (namely,  the  large  tuning  range)  is 
given  special  attention. 

A  typical  PLL-based  frequency  synthesizer  is  shown  in  Figure  7.1  and  contains  a 
reference  source  oscillating  at  frequency/,  and  a  VCO  oscillating  at  frequency  f„.  The 
reference  frequency  is  divided  by  an  integer  N  and  the  VCO  frequency  is  divided  by 
M;  the  two  divided  waves  are  then  compared  in  a  phase  detector.  When  the  two  phases 
are  equal  (phase  locking),  then  'jj  =  jfc .  This  also  means  that  the  output  frequency  is 
locked  to  a  rational  fraction  of  the  reference  frequency.  In  essence,  the  synthesizer  is 
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Fig.  7.1    PLL  based  frequency  synthesizer 
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Fig.  7.2a    Block  diagram  of  a  PLL 


capable  of  generating  a  large  number  of  highly  accurate  output  frequencies .  Frequency 
selection  is  achieved  by  changing  the  divider  ratios  M  and  N. 

Since  the  frequency  synthesizer  is  PLL  based,  a  background  discussion  of  the 
PLL  is  in  order. 

Figure  7.1  is  now  redrawn  in  Figure  7.2a  to  facilitate  the  explanation  of  the 
operating  principle  of  a  PLL.  Notice  that  the  dividers  are  omitted,  but  this  does  not 
have  any  major  impact  on  our  explanation. 

In  general,  a  PLL  synchronizes  an  output  signal  (usually  generated  by  an 
oscillator)  with  a  reference  signal  in  terms  of  its  frequency  and  phase.  In  the  locked 
state,  the  phase  error  between  the  output  and  reference  signal  is  very  small,  as 
ensured  by  the  negative  feedback  principle.  When  there  is  change  in  the  input, 
output,  or  the  internal  parameters  of  the  synthesizer  itself  (e.g.,  changing  the  divider 
ratio,  if  there  is  a  divider),  then  a  phase  error  is  introduced.  Means  are  devised 
whereby  this  error  is  converted  to  some  form  that  can  control  and  direct  the 
oscillator  output  frequency  and  phase.  Together  with  negative  feedback  this  fre- 
quency and  phase  change  occurs  in  such  a  way  as  to  reduce  the  error.  As  shown,  the 
PLL  consists  of  the  following  subcomponents: 

1 .  A  phase  detector  (PD)  and  logic  to  charge  converter 

2.  A  loop  filter  (LF) 

3.  A  voltage  control  oscillator  (VCO) 

The  signals  are  as  follows:  reference  signal  Vr  with  frequency/,  and  phase  9,, 
output  signal  V0  (which  also  equals  the  VCO  output  signal)  with  frequency  f0,  phase 
0o,  phase  detector  output  signal  <7Pdjinai  (a  charge  variable),  phase  error  Be,  and  loop 
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Fig.  7.2b   PlotofVCO 

frequency  versus  tuning 
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filter  output  signal  V\f.  Next,  let  us  look  at  the  input/output  relationships  of  these 
subblocks.  First  let  us  look  at  the  VCO  where 


fo 


dQ0 
dt 


/  vco— center 


^vco  X  Vlf 


(7.1) 


Here  /VCo-center lS  the  center  frequency  of  the  VCO  and  Kvco  is  the  VCO  gain  in 
units  of  deg/s/V./0  is  plotted  as  a  function  of  V\f  in  Figure  7.2b.  Second,  the  phase 
detector  compares  the  phase  of  the  output  signal,  6„,  with  the  phase  of  the  reference 
signal,  6,.,  and  develops  Qe.  Qe  turns  on  the  logic  to  charge  (current)  converter  and 
generates  a  current  ipd  and  starts  to  deliver  charge  <7pd  to  the  capacitor  in  the  loop 
filter  F(s).  At  the  end  of  a  reference  signal  cycle,  the  charge  delivered,  denoted  as 
<7pd_finai>  is  designed  to  be  linearly  proportional  to  phase  error  6,,,  as  shown  in 
Figure  7.2c  and  expressed  as  follows: 


<7pd_final  =  K  d  x   (Br  —  90) 


*pd>< 


(7.2a) 


Here  Kpd'  is  the  phase  detector  gain  with  units  in  coulomb/deg.  The  qpd _finai, 
when  integrated  on  the  capacitor  of  the  loop  filter,  will  generate  a  voltage  V\f,  whose 
value  is  the  integrated  (sum)  value  of  all  the  Qe  changes.  The  instantaneous  value  of 
Vif  is  not  proportional  to  the  instantaneous  value  of  9e,  but  rather  the  integrated 
value  of  Qe.  If  the  loop  filter  is  a  simple  capacitor  C,  then 


Vif  («)  =  qc(n)/C 


(7.2b) 


where  n  is  the  nth  cycle  of  the  reference  clock.  Here  V\t{n),  qc(n)  are  the  voltage  and 
charge  on  capacitor  C  at  reference  cycle  n.  Also, 


qc{n)  =  qc{n-\)  +  <?pd_fmai(») 


(7.3) 
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Fig.  7.2c    Plot  of  charge 
delivered  and  average  current 
in  a  phase  detector  versus 
phase  error 


QpeUnal'  'pd 


(c) 


Alternatively,  we  can  adopt  another  variable  /pd,  defined  to  be  the  average  of  zpd 
over  one  reference  clock  cycle.  This  ;pd  is  also  designed  to  be  linearly  proportional 
to  6e,  as  shown  in  Figure  7.2c: 


'pd 


=  Z 


pdff 


(7.4a) 


KpA  is  the  phase  detector  gain  with  units  A/deg.  (Note  the  difference  between  A"'pd 
and  A"pd.)  Again  this  current  will  flow  into  the  loop  filter  and  create  a  Vlf.  Then  we 
can  write 


Vff(»)=^AT  +  Vif(i.-l) 


(7.4b) 


Here  AT  is  the  period  when  the  phase  detector  is  on.  Also,  we  have 


Vtt(n) 


'pd 

~c 


T  +  Va(n-1) 


(7.4c) 


Here  T  is  the  period  of  V,.  and  equals  A . 


We  are  now  in  a  position  to  explain  qualitatively  the  operation  of  the  PLL.  Let  us 
assume  that  initially/,,  is  equal  to  fVCo_center  and  the  VCO  also  oscillates  at/vcocenter. 
Therefore,  Qe  is  zero  and  <7Pd_finai  ('pd)  is  zero.  Assuming  initially  that  Vlf  is  zero,  this 
means  that  Vif  stays  at  zero  and  everything  is  in  equilibrium.  If  Qe  is  nonzero 
initially,  then  <7Pdjinai  and  hence  Vlf  would  be  nonzero  after  a  delay.  f0  will  then 
deviate,  forcing  Qe  to  settle  back  to  zero.  This  is  depicted  in  Figure  7.3  before  time 
t0.  What  happens  when  there  is  a  change  of  frequency  now?  Let  us  assume  that  the 
reference  signal  V/s  frequency/,,  is  arbitrarily  increased  by  A/ at  t0.  As  shown  in 
Figure  7.3b,  a  higher  frequency  means  a  shorter  period,  and  therefore  the  phase  of 
Vr  starts  leading  the  phase  of  V0.  This  nonzero  phase  error  6,,  increases  with  time. 
After  some  delay,  this  increasing  Qe  will  lead  to  an  increase  in  <7Pd_flnai  ('pd)- 
This  causes  Vif  to  increase,  resulting  in  a  change  to/c,  as  described  in  Figure  7.3c 
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Fig.  7.3  Plot  of  internal  voltage  and  frequency  changes  in  a  PLL  when  reference  frequency 
changes,  (a)  Reference  voltage,  (b)  VCO  voltage,  (c)  Loop  filter  output  voltage,  (d)  VCO 
frequency 
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and  Figure  7.3d.  Due  to  negative  feedback,  this  leads  to  a  decrease  of  Qe.  Eventually 
the  output  frequency/,,  will  be  the  same  as/,..  Notice  that  in  contrast  to  the  change  in 
the  input  phase  case,  depending  on  the  order  of  the  filter  F(s),  the  final  Qe  may  go 
back  to  zero  or  a  finite  value. 


7.3     Phase  Detector/Charge  Pump 

The  purpose  of  the  phase  detector  is  to  produce  a  signal  that  is  proportional  to  the 
difference  in  phase  between  two  signals.  There  are  two  main  categories  of  phase 
detectors:  analog  and  digital.  In  frequency  synthesizers,  almost  all  phase  detectors 
are  digital.  The  key  characteristics  are  as  follows: 

1 .  Gain  (sometimes  associated  with  the  output  or  charge  pump  stage) 

2.  Linearity 

3.  Steering  characteristics 

There  are  many  different  types.  Of  special  interest  are  the  following: 

1 .  Tristate  phase/frquency  detector  (PFD) 

2.  JK  flip-flop  based  PD 

3.  EXOR  gate  based  PD 

For  most  PLL-based  frequency  synthesizers,  the  PFD  is  the  preferred  choice. 
In  other  applications,  other  types  of  PD,  such  as  EXOR  and  JK  based  PD,  can  also 
be  used.  In  this  chapter  we  will  concentrate  on  PFD  but  we  will  briefly  compare 
it  with  another  PD. 


7.3.1     Phase  Frequency  Detector 

Let  us  first  look  at  an  example  of  a  PFD  [14].  Figure  7.4  shows  a  PFD  based  on 
D-FF.  (It  should  be  noted  that  another  realization  also  exists.  A  JK-FF  based  PFD  is 
very  popular  and  potentially  dissipates  less  power.)  The  PFD  differs  greatly  from 
the  other  type  of  phase  detector.  The  major  difference  is  the  existence  of  a  third 
state,  which  will  be  shown  to  lead  to  major  advantage  over  other  types  of  PDs.  The 
PFD's  output  signal  depends  not  only  on  phase  error  6e  but  also  on  frequency  error 
A/  =/,  —f0  before  locking  is  acquired.  The  present  PFD  is  built  from  two  D-FFs, 
whose  outputs  are  denoted  Up  and  Down  and  whose  state  variable  is  denoted  as  dpd. 
Since  we  have  two  storage  elements,  dpd  has  four  states: 

1.  Up  =  0,  Down  =  0 

2.  Up=  l,Down  =  0 
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Fig.  7.4    Phase  frequency  detector  based  on  D-FF 


3.  Up  =  0,  Down  =  1 

4.  Up=  l,Down=  1 

Since  we  only  need  a  tri-state  device,  one  of  the  states  is  unused.  In  this  case  we 
arbitrarily  select  that  to  be  the  fourth  state.  However,  we  want  to  ensure  that  if  we 
accidentally  end  up  in  this  unused  state,  it  will  not  get  stuck  there  (called  lockout). 
This  is  guaranteed  by  ensuring  that  when  the  fourth  state  is  entered  the  PFD  will 
exit  into  one  of  the  used  states  (arbitrarily  picked  to  be  the  first  state). 

In  actual  hardware,  this  is  achieved  so  that  whenever  the  fourth  state  is  detected, 
the  AND  gate  in  Figure  7.4  is  activated  and  its  output  resets  both  flipflops  back  to 
the  first  state.  Notice  that  this  AND  gate  is  also  instrumental  in  determining  the  state 
transition.  The  state  and  dpd  are  now  defined  as  follows: 


1. 

Up  =  0,  Down  =  1 ,  state  = 

=  dpd  =  - 1 

2. 

Up  =  0,  Down  =  0,  state  = 

=  dpd  =  0 

3. 

Up  =  1 ,  Down  =  0,  state  = 

=  dpd  =  1 

4. 

Up  =  1 ,  Down  =  1 ,  state  = 

=  dpd  =  inhibited 

The  actual  state  of  the  PFD  is  determined  by  the  signals  Vr,  V0,  the  two  trigger 
(both  positive  edge  trigger)  signals  applied  to  the  two  D-FFs.  Figure  7.5  is  the  state 
diagram  that  describes  the  operation  of  the  circuit  as  depicted  in  Figure  7.4.  First  let 
us  go  through  how  the  circuit  in  Figure  7.4  implements  the  state  diagram  in 
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Fig.  7.5    State  diagram  of  phase  frequency  detector 


Figure  7.5  by  going  through  a  state  transition  sequence.  Assume  from  Figure  7.5 
that  we  are  initially  in  the  -1  state  (i.e.,  Up  =  0,  Down  =  1).  Now  from  Figure  7.4,  a 
positive  transition  of  Vr  will  trigger  the  DFF1  [i.e.,  gating  the  D  input  (=1)  of  DFF1 
to  the  output  Q,  which  is  also  the  Up  signal].  Since  D  =  1,  Q  becomes  1  or  Up 
changes  from  0  to  1.  Meanwhile,  since  V0  does  not  move  DFF2's  output  of  course 
remains  the  same  as  before  (i.e.,  Q  —  1  or  Down  stays  at  1).  In  summary,  the  (Up, 
Down)  pair  moves  from  (0,1)  to  (1,1). 

Notice  that  this  (1,1)  is  the  fourth  or  inhibited  state.  From  Figure  7.4,  since  both 
Up  and  Down  =  1,  the  AND  gate  is  enabled  and  the  reset  (R)  signal  is  high. 
Therefore,  DFF1  and  DFF2  are  both  resetted  and  (Up,  Down)  goes  to  (0,0)  or  dpd 
becomes  0,  just  as  depicted  in  Figure  7.5. 

Next  let  us  look  at  how  the  transition  from  state  0  to  1  occurs  in  Figure  7.5. 
Since  V,'s  edge  rises  again,  Up  (=  Q  of  DFF1)  goes  from  0  (the  reset  state)  to  1. 
Again,  since  V0  does  not  change,  Down  (=  Q  of  DFF2)  stays  at  0.  Therefore  the 
state,  or  dpd  goes  from  0  to  1,  as  shown  in  Figure  7.5.  From  symmetry  the  exact 
opposite  will  happen  when  we  start  from  the  1  state.  Here  repeated  activation  of  V0 
will  eventually  take  the  PFD  to  the  -1  state.  To  summarize,  a  positive  edge  of  Vr 
forces  the  PFD  to  go  into  its  next  higher  state,  unless  it  is  already  in  the  1  state. 
Similarly,  a  positive  edge  of  V0  results  in  the  PFD  going  into  its  next  lower  state, 
unless  it  is  already  in  the  —1  state. 

We  now  go  through  an  example  and  see  how  this  circuit  actually  detects  phase. 
Referring  to  Figure  7.6a,  where  events  are  labelled  from  1  to  8,  right  next  to  the 
positive  edge  of  Vr  and  V0,  imagine  that  originally  the  PFD  is  in  the  0  state.  When  the 
V0  edge  goes  up  (event  1),  according  to  Figure  7.5,  the  state  changes  from  0  to  -1. 
This  is  shown  in  the  dpd  trace  of  Figure  7.6a.  Then  when  the  Vr  edge  rises  (event  2), 
according  to  Figure  7.5  the  state  changes  from  —1  to  0.  In  Figure  7.6a  this  means 
that  dpd  goes  from  -1  to  0.  Hence  dpd  goes  from  0  to  -1  and  back  to  0  again.  Notice 
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Fig.  7.6a  An  example  illustrating  the  operation  of  a  phase  frequency  detector  (qpd  is  assumed  to 
be  resetted  at  the  beginning  of  each  phase  comparison;  similarly  qp(j jnai  is  assumed  not  to 
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H  pdjinal,  Ipd 


-6k 


-47i      -  2tx 


c  27i         4tc  67c 

event 

(3,4) 


radians 


(b) 

Fig.  7.6b  Plot  of  charge  delivered  and  average  current  in  a  phase  frequency  detector  versus  phase 
error 

that  the  duration  when  dpd  stays  in  -1  measures  the  phase  difference  between  V0  and 
Vr  and  therefore  its  value  is  linearly  proportional  to  the  phase  difference  Qe  between 
V,-  and  V0.  This  is  evident  in  that  part  of  the  dpd  trace  that  corresponds  to  event  (1,2). 
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Now  we  will  see  how  to  generate  <7Pd_nnai  &>d)  for  event  (1,2)  and  show  that 
<7pd_finai  Opd)  is  proportional  to  the  duration  when  dpd  is  in  the  '-1'  state.  First  we 
reiterate  that  the  signal  dpd  is  a  logical  variable  that  represents  the  PFD  state  and  has 
three  states.  Most  logical  circuits  used  today  generate  binary  signals  (1,-1);  how- 
ever, the  third  state  (dpd  =  0)  can  be  substituted  by  a  high-impedance  state. 
The  circuitry  in  Figure  7.4  within  the  dashed  box  shows  how  the  three  states  are 
generated  and  how  logic  to  charge  (current)  conversion  is  achieved.  When  the  Up 
signal  is  high,  the  PMOS  transistor  M2  conducts,  so  the  phase  detector  current,  ipd,  is 
positive  (pushing)  and  the  charge  delivered  by  the  phase  detector  to  the  capacitor  C, 
<7pd,  is  positive.  When  the  Down  signal  is  high,  the  NMOS  transistor  M;  conducts,  ipd 
is  negative  (sinking)  and  <7pd  is  negative.  When  neither  signal  is  high,  ipd  is  zero  (high 
impedance)  and  <7pd  is  zero.  Hence  the  sign  of  qpd  and  ;pd  follows  the  same  sign  as 
dpd.  A  trace  of  qpd  is  shown  in  Figure  7.6a.  Now  let  us  assume  that  when  M)  or  M2  is 
conducting,  ipd  is  constant  and  whose  value  is  denoted  as  /Charge_pump-  Then  if  we 
look  at  the  final  charge  delivered  to  C,  or  <7Pd_finab  it  is  obviously  proportional  to 
the  time  whenM;  orM2  is  on.  Hence  gPd_finai  is  proportional  to  the  duration  when  the 
phase  detector  is  on,  which  is  denoted  in  (7.4b)  as  AT.  For  event  (1,2),  since  qpd  is 
negative,  qpdjinai  is  also  negative.  We  can  then  write 

^7pd_final  ^charge_pump^-'  \'  *J) 

Next  we  can  replace  Ar  by  —  QeT/2n,  where  T  is  the  period  of  the  reference  signal 
Vr  (a  constant)  and  Qe  is  the  phase  difference  to  be  detected.  There  is  a  negative  sign 
in  front  of  Qe  because  Ar  is  always  positive  while  Qe  for  event  (1,2)  is  negative. 
With  this  replacement  (7.5)  becomes 

*7pd_final         ^ charge _pump  ~Z  l'-Oj 

Looking  at  (7.6)  again,  we  finally  show  that  <7Pd  Anal  is  proportional  to  Qe,  as  we 
have  stipulated  in  (7.2a),  with 

„/  1  charge  .pump**  ,_  ,,, 

**  = In (7-7) 

Similarly,  from  definition 

—       1 


'pd  —  j 


ipddt 


1  QeT 

rj-,  '  charge  _pump   « 


' charge _pump  ^  \  '  •"/ 
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ipd  is  shown  in  Figure  7.6a.  From  (7.8)  it  is  seen  that  this  /p[j  is  also  proportional  to 
Qe.  Comparing  (7.8)  with  (7.4a)  we  have 

„            I  charge  _pump  ,_  „, 

^pd  = r (/.y) 

In 
Note  that 

Kpd  =  K'pd/T  (7.10) 

The  current  sources  M\,  M^  are  collectively  denoted  as  logic  to  charge  (current) 
converter,  more  popularly  known  as  the  charge  pump.  It  serves  to  convert  the  logic 
variable  dpd  to  a  signal  variable  zpd.  Capacitor  C  (the  simplest  example  of  the  loop 
filter)  then  converts  /pd  to  #Pd_finai-  Both  the  charge  pump  and  the  loop  filter  are 
discussed  in  further  detail  later. 

Having  explained  how  the  charge  pump  generates  qPd_hnai  ('pd)  for  the  event 
(1,2),  it  is  helpful  to  go  back  to  Figure  7.6  to  illustrate  that  it  works  equally  well  for 
other  events.  Now  if  we  progress  to  event  3,  Vr  comes  along  first  this  time. 
According  to  Figure  7.5,  the  state  goes  from  0  to  1.  On  event  4  when  V0  comes 
along,  the  state  goes  back  to  0.  Consequently,  dpA  goes  up  and  then  down.  Again  it  is 
seen  that  qpd_anli\  assumes  a  value  that  is  proportional  to  the  phase  difference. 
Notice  that  in  the  present  case,  with  a  positive  phase  difference  (defined  as  phase 
of  Vr  -  phase  of  Va),  the  PFD  outputs  a  positive  dpd.  Therefore,  this  PFD  measures 
both  the  magnitude  and  the  sign  of  the  phase  difference.  This  is  highlighted  in 
Figure  7.6b,  which  plots  qpd _finai  and  ;pd  as  a  function  of  Qe .  From  Figure  7.6b  it  is 
seen  that  qpd_finai  becomes  largest  when  the  phase  error  is  positive  and  approaches 
360°.  Beyond  2n  what  happens?  The  PFD  behaves  as  if  the  phase  error  recycled  at 
zero.  Why  does  this  happen?  An  easy  answer  is  that  the  state-machine  whose  state 
diagram  is  described  in  Figure  7.5  cannot  distinguish  between  9e  and  Qe  +  360°. 
An  example  of  this  is  shown  in  Figure  7.6a,  when  we  trace  the  value  of  dpd  during 
the  events  (1,2)  and  events  (1,3).  If  we  take  a  look  at  the  phase  error  corresponding 
to  these  two  events  [as  shown  in  Figure  7.6a],  0e  (1,3)  =  Qe  (1,2)  +  360°.  This  is 
because  edge  3  is  exactly  one  period  delayed  from  edge  2.  Now  if  we  look  at  dpd  in 
the  same  figure,  we  see  that  dpd  does  not  change  as  we  move  from  edge  2  to  edge  3. 
It  stays  at  0.  Hence  <7pd _flnal  for  event  (1,2)  =  qpd  for  event  (1,3).  If  we  look  at 
Figure  7.6b,  this  corresponds  to  points  A  and  B  on  the  transfer  curve.  Hence 
0e  differs  by  360°  but  has  the  same  qpd _finai.  Basically,  the  PFD  slips  a  cycle. 
Therefore,  the  <7pd _flnal  versus  Qe  curve  as  described  in  Figure  7.2c  repeats  itself 
(like  a  sawtooth)  and  the  relationship  is  no  longer  linear  (it  is  periodic).  This  is 
shown  in  Figure  7.6b.  This  nonlinear  behaviour  of  PFD  is  denoted  as  cycle  slipping, 
and  hence  the  PLL  is  a  nonlinear  system.  In  this  chapter  we  assume  that  we  always 
work  within  the  360°  region  (which  is  true  for  a  frequency  synthesizer)  and 
therefore  we  assume  that  the  PLL  is  linear. 
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Other  phase  relationship  possibilities  are  depicted  in  events  5-8,  which  are  self- 
explanatory  if  we  go  through  Figure  7.5.  One  interesting  scenario  is  event  8,  when 
Vr  and  V„  have  exactly  the  same  phase.  This  is  an  ill-defined  situation,  and  what 
happens  really  depends  on  the  internal  delay  of  the  logic.  There  will  be  some 
transients  but  dpd  will  always  settle  back  to  state  0.  When  in  lock,  Vr  «  V0. 
Imagine  for  the  time  being  that  Vr  has  the  same  frequency  as  V(1  but  is  slightly 
leading  in  phase,  dpd  therefore  consists  of  a  narrow  pulse  train  at  frequency  fr. 
This  pulse  train  generates  spurious  components  (spurs)  and  will  have  serious  impact 
on  the  final  spectral  purity  of  the  frequency  synthesizer. 

Finally,  let  us  point  out  that  the  PFD  is  operating  as  an  asynchronous  (or 
fundamental)  digital  circuit  and  therefore  all  the  problems  in  the  asynchronous 
circuits  will  appear  (like  hazards)  and  will  be  a  problem.  Also,  the  setup  time 
and  the  hold  time  need  to  be  further  investigated  to  see  (1)  if  they  will  be  violated 
and  (2)  if  they  have  been  violated,  what  impact  they  will  have  on  the  operation  of 
the  PFD.  Fortunately,  since  the  PFD  for  a  frequency  synthesizer  typically  works  at 
rather  low  frequency,  the  preceding  constraints  do  not  usually  pose  a  problem. 

To  highlight  the  advantages  of  the  PFD,  we  next  review  another  phase  detector 
based  on  the  combinational  circuit:  the  EXOR  gate. 


7.3.2    EXOR  Phase  Detector 


Figure  7.7a  shows  the  EXOR  gate-based  PD  [14],  and  Figure  7.7b  depicts  the 
waveform  of  the  EXOR  PD  for  different  Qe.  Let  us  show  how  the  EXOR  PD  differs 
from  the  PFD.  First  it  should  be  noted  that  dpd  only  has  two  states:  —1  and  1,  as 
opposed  to  the  three  states  available  in  a  PFD.  Now  from  the  waveform  it  can  be 
seen  initially  that  V0's  phase  leads  Vr's.  Since  Vr  is  at  a  higher  frequency  than  Vot 
eventually  Vr's  phase  leads  V0.  Notice  that  upon  comparing  events  (1,2)  and  (3,4), 
dpd  is  the  same,  which  means  that  the  EXOR  gate  cannot  tell  which  edge  is  leading. 
Also  from  Figure  7.7c,  qPd_fmai is  exactly  0  when  Ge  is  exactly  90°  (it  does  not  matter 
which  wave  is  leading)  rather  than  when  Qe  =  0.  Comparing  Figure  7.6a  and 
Figure  7.7a,  it  is  seen  that  dpd  changes  twice  as  fast  here.  Therefore,  following 
the  same  argument  on  PFD,  when  the  PLL  is  in  lock,  the  present  PD  will  generate  a 
spur  that  is  at  twice  the  frequency  of/,.  The  duty  cycle  of  dpd  is  exactly  50%. 

Having  highlighted  the  difference  between  the  two  detectors,  let  us  go  through 
some  detailed  explanations  of  the  EXOR  circuit  by  following  through  Figure  7.7b. 
Initially  Vr  is  lagging  Va  [event  (1,2)],  Qe  is  negative,  and  dpd  stays  in  the  -1  state 
more  often  than  in  the  1  state.  Hence  M2  is  on  more  often  than  M;.  Therefore,  more 
charge  is  sunk  from  C  through  M2  to  ground  than  is  sourced  to  C  through  Mx  from 
Vdd.  Accordingly,  gPd_finai  is  negative.  As  time  progresses,  V,  starts  to  lead  V0  (event 
(3,4)).  We  may  expect  <7Pd_finai  to  t>e  positive.  However,  if  we  focus  on  dpd  in 
Figure  7.7b  during  event  (3,4)  one  quickly  finds  out  that  <7Pdjinai  is  still  negative. 
This  is  because,  as  discussed  previously,  qpdana\  is  exactly  0  when  the  phase 
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Fig.  7.7a    Phase  detector  based  on  EXOR  gate 
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Fig.  7.7b    An  example  illustrating  the  operation  of  a  phase  detector  based  on  EXOR  gate 
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Fig.  7.7c    Plot  of  charge  delivered  and  average  current  in  a  phase  detector  based  on  EXOR  gate 
versus  phase  error 
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difference  is  exactly  90°  and  so  there  is  an  offset  (note  that  in  the  PFD  case  there  is  no 
such  offset).  In  the  present  case,  since  Vr  does  not  lead  Va  quite  by  90°,  therefore 
<7pd_finai  is  still  negative,  as  shown  in  Figure  7.6b.  Eventually  Qe  is  positive  and  larger 
than  90°  [event  (4,5)].  We  can  see  that  in  this  case  <7Pd_finai  becomes  positive,  as 
shown  in  Figure  7.7c.  Consequently,  we  can  see  that  the  segment  from  6e  =  0  to  1 80° 
is  the  same  as  in  the  PFD  case  except  that  it  is  offset  both  in  the  Qe  (by  90°)  and 
<?pd_finai  (by  <?pd-f"|''-°'" )  axis.  What  happens  when  Qe  increases  beyond  n  radians?  As 
shown  in  event  (6,7)  of  Figure  7.7b,  dpds  duty  cycle  starts  to  decrease  and  hence 
<7pd_flnai  also  starts  to  decrease,  as  drawn  in  Figure  7.7c.  This  continues  until  <7Pd_finai 
decreases  to  its  minimum  value.  Note  that  this  is  different  from  Figure  7.6b.  In 
Figure  7.7c,  we  have  a  triangular  plot  as  opposed  to  the  sawtooth  plot  in  Figure  7.6b. 
Next,  what  happens  as  Qe  increases  beyond  2jc  radians?  Exactly  the  same  phenome- 
non as  in  the  PFD  case  will  happen.  Since  the  PD  cannot  distinguish  between  Be  and 
0e  +  360°,  the  PD  will  slip  a  cycle.  That  is,  the  transfer  characteristic  will  repeat 
itself.  Notice  that  the  same  repetition  occurs  in  the  negative  6e  axis  (that  is,  if  8,,  goes 
below  0°).  On  the  other  hand,  PFD  behaves  differently  with  a  negative  8,,. 

Despite  all  of  these  differences,  it  seems  that  EXOR  is  not  fundamentally 
inferior  to  PFD.  Its  major  disadvantage,  however,  is  that  it  is  very  sensitive  to 
waveform  symmetry.  It  is  sensitive  to  both  the  duty  cycle  and  the  finite  rise/fall  time 
of  Vr  and  V0.  From  event  (6,7)  in  Figure  7.7b,  it  is  obvious  that  dpd  depends  not  only 
on  the  rising  edge  but  on  the  falling  edge  of  V,  and  V0  as  well  (not  so  in  PFD);  that 
is,  on  the  duty  cycle.  Figure  7.8a,  and  Figure  7.8b  further  highlight  this  dependency 
on  finite  rise,  fall  time.  In  Figure  7.8a,b,  we  have  two  identical  inputs  (V,.,  V0)  to  the 
EXOR  PD,  except  that  in  Figure  7.8a,  the  V,-  waveform  has  a  finite  rise  and  fall 
time.  Right  next  to  each  edge  we  attach  an  edge  number.  Comparing  dpA  in 
Figure  7.8a  and  Figure  7.8b  they  can  be  seen  to  be  different  because  of  this  finite 
rise/fall  time  of  Vr.  Specifically,  because  of  finite  rise  and  fall  time  in  Figure  7.8a, 
the  dpd's  edge  1  is  delayed  compared  with  the  dpd's  edge  1  in  Figure  7.8b  (edge  1 
comes  along  only  when  V,-  in  Figure  7.8a  crosses  the  logic  threshold).  Therefore, 
in  Figure  7.8a,  fifpcj's  (1,  2)  pulse's  on  duration  is  shorter  than  its  counterpart  in 
Figure  7.8b.  Similarly,  because  of  finite  fall  time  in  Figure  7.8a,  the  cfpd's  edge  3 
happens  before  the  fifpd's  edge  3  in  Figure  7.8b.  Therefore,  in  Figure  7.8a,  dpd's  (3  4) 
pulse's  on  duration  is  longer  than  its  counterpart  in  Figure  7.8b. 

This  dependency  on  the  actual  waveform  (not  just  edge)  runs  contrary  to  what  a 
true  phase  detector  is  supposed  to  respond  to:  only  to  edge  timing.  On  closer  look  we 
can  see  why.  In  an  EXOR  PD,  and  for  that  matter  any  combinational  logic -based  PD, 
the  device  is  really  not  a  phase  detector  but  a  duty-cycle  detector.  It  so  happens  that  if 
the  rise/fall  time  of  the  waveform  is  zero,  then  there  is  a  linear  relationship  between  the 
duty  cycle  of  a  waveform  and  its  phase,  making  this  circuit  useful  in  detecting  phase. 

Exactly  the  opposite  is  true  for  sequential  logic-based  PD  (of  which  PFD  is  one 
type;  JK-FF  based  PD  is  another  popular  one).  Here  since  the  PD  responds  to  an 
edge,  there  is  no  waveform  dependency.  The  difference  of  responding  either  to 
duty  cycle  or  edge  timing  also  helps  to  explain  one  of  the  differences  observed 
earlier  in  the  gPd_finai  versus  8e  curve  for  these  two  types  of  phase  detector.  It  is 
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Fig.  7.8    Sensitivity  of 
EXOR  gate  based  phase 
detector  to  finite  rise,  fall 
time,  (a)  dpc|  with  finite  rise, 
fall  time  in  Vr  (b)  dpd  with 
zero  rise,  fall  time  in  Vr 
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obvious  that  for  the  combinational  logic-based  PD  since  the  duty  cycle  changes 
continuously  at  the  cycle  boundaries  [e.g.,  at  Qe—2n  in  Figure  7.7c],  <7Pd_finai 
also  changes  continuously  with  Qe  at  these  cycle  boundaries.  Because  the  duty 
cycle  changes  continuously,  only  the  slope  (or  '  g^Qf1""' )  changes  abruptly  at  these 
boundaries.  The  same  cannot  be  said  for  a  sequential  logic -based  detector. 
For  sequential  logic-based  PD  at  the  boundaries  [e.g.,  at  Qe=2n  in  Figure  7.6b], 
<?pd_flnat  aiSO  changes  abruptly  because  once  you  miss  the  edge,  you  miss  the  whole 
cycle  and  you  start  all  over  again.  This  is  exactly  what  was  observed  before  as  a 
difference  between  Figure  7.7c  and  Figure  7.6b  at  these  boundaries. 


7.3.3     Charge  Pump 


As  shown  previously,  the  PD  (both  PFD  and  EXOR  based)  needs  a  logic  to  charge 
(current)  converter  or  a  charge  pump.  This  charge  pump  does  one  more  thing: 
it  realizes  integration  without  an  op-amp  (or  any  other  active  device).  We  now 
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Fig.  7.9    Generic  model  of  a  Vdd 
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consider  some  unique  features  of  this  charge  pump.  To  do  so  it  is  helpful  to  develop  a 
generic  model  of  the  charge  pump,  which  is  shown  in  Figure  7.9  [22] .  In  general,  the 
C  can  be  a  more  complex  loop  filter.  Most  of  the  time  the  three  output  states  are 
used  with  a  PFD,  but  it  is  also  possible  to  have  combinatorial  phase  detectors  with 
three-state  logic  outputs.  These  types  of  PD,  however,  do  not  have  the  frequency- 
detection  property  of  the  PFD.  One  unique  feature  (and  major  advantage)  of  having  a 
three-state  output,  like  the  one  in  the  charge  pump,  is  the  existence  of  an  idle  state, 
which  occurs  when  the  PLL  is  in  lock.  This  idling  means  that  output  is  zero  and  ideally 
spur  does  not  exist.  Therefore,  the  VCO  is  not  modulated  with  spur  and  V0  does  not 
have  sidetones  due  to  frequency  modulation.  In  practice,  there  will  be  voltage  ripple, 
which  causes  spurs  to  occur  at  frequency  fr.  This  comes  about  because  there  is  always 
a  small  amount  of  charge  that  needs  to  be  supplied  to  compensate  for  charge  leaking 
away.  This  charge  leakage  is  due  to  mismatch  between  current  sources  Ip  and  /„. 

Another  feature  of  any  charge  pump  is  its  time-varying  properties.  Because  of 
the  switching  inherent  in  the  charge  pump,  the  PLL  is  a  time-varying  network  and  is 
best  analyzed  as  a  discrete  time  circuit.  One  complication  about  analyzing  the 
charge  pump  circuit  as  a  discrete  time  circuit  is  the  fact  that  the  sampling  operation 
in  a  PFD/charge  pump  is  nonuniform.  This  is  because  the  V0's  edge  can  arrive  at  the 
input  to  the  PFD  any  time  (nonuniformly),  in  particular  during  the  acquisition 
phase.  When  the  system  is  locked,  it  becomes  uniform  sampling  only  when  the 
reference  frequency  is  constant,  which  fortunately  is  the  case  in  the  present 
application.  Hence,  we  can  treat  the  PLL  as  a  discrete  time  circuit  with  uniform 
sampling  once  it  is  in  lock.  Further  simplification  in  analysis  is  possible  if  we 
observe  that  V/f  varies  by  a  very  small  amount  on  each  cycle  of  the  reference  signal 
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(in  other  words,  loop  bandwidth  is  small  compared  with  the  reference  frequency). 
Consequently,  the  uniform  time-varying  operation  can  be  averaged  and  the 
discrete  time  circuit  is  transformed  into  a  continuous  time  (time  invariant)  network, 
as  in  the  subsequent  analysis. 


7.3.4     Spur 

7.3.4.1     Impact 

As  stated  previously,  spurious  frequency  components,  which  we  have  denoted  as 
spurs,  at  one  time  (for  PFD)  or  two  times  (for  EXOR  PD)  the  reference  frequency 
can  be  generated.  These  spurs  will  be  filtered  by  the  loop  filter.  However,  though 
attenuated,  the  remaining  spurs,  if  strong  enough,  when  applied  to  the  VCO  will 
phase  modulate  the  VCO  and  generate  undesired  sidetones.  These  sidetones  can 
mix  down  unwanted  adjacent  channel  interference  (or  other  unwanted  signals)  to 
the  same  IF  as  the  desired  signal  and  corrupt  the  desired  signal. 


7.3.4.2     Origin  and  Calculation 

Let  us  refer  to  Figure  7.4  for  the  time  being  and  assume  that  we  have  a  fixed  steady- 
state  phase  error  Qe  that  is  positive.  This  means  that  Vr  is  always  on  for  a  fixed  period 
of  time  before  V0  is  on.  One  such  example  is  event  (3,  4)  in  Figure  7.6a.  Here  it  is 
seen  that  dpd  is  1  for  the  f-  T  period  and  0  for  the  rest  of  the  period.  This  part  of  the  dpd 
trace  is  redrawn  in  Figure  7.10a.  The  corresponding  ipd  waveform  is  shown  in 
Figure  7.10b,  where  it  is  seen  that  the  charge  pump  delivers  a  current 
'pd  =  Charge _pump  for  ^  fraction  of  a  period  T  (where  T  —  l,fr  being  the  reference 
frequency).  For  the  rest  of  the  period  ;pd  =  0.  Now  where  does  this  steady-state  phase 
error  9,,  come  from?  The  answer  lies  in  the  mismatch  between  the  top  and  bottom 
transistors  M2,  Mx  in  Figure  7.4.  Referring  again  to  Figure  7.10a  and  Figure  7.10b, 
we  assume  that  ;pd(0  has  a  duty  cycle  d  —  |£ .  For  typical  technology  and  transistor 
sizing,  d  turns  out  to  be  on  the  order  of  0. 1  % .  If  we  take  a  Fourier  series  expansion  of 
/pd(t),  it  follows  that  the  spectrum  of  the  waveform,  denoted  as  ipd(f),  consists  of 
discrete  frequency  components  spaced/,-  apart,  as  shown  in  Figure  7.11.  These  are 
the  spurs.  The  envelope  of  these  frequency  components  follows  a  sin(f)/f  function. 
The  dc  component  is  given  by  dc  =  ipd  =  d  x  7charge_pump-  The  spur  is  represented  by 
the  frequency  component  closest  to  dc,  as  that  component  is  most  damaging.  This  is 
denoted  as  ;spur  and  is  given  by  Aspursinco,.?.  Since  fr  is  close  to  dc,  then  Aspur  = 
'Pd(/)/=o  and  so  Aspllr9^pd.  Consequently, 


(spur 


/pd  SinGV*  =  d  X  /chaige_pumpSilKiM  (7.11) 
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Fig.  7.10    (a)  Plot  of  steady-state  dpd  as  a  function  of  time,  (b)  Plot  of  steady-state  phase  detector 
current  as  a  function  of  time 


We  may  choose  to  represent  spur  in  the  phase  rather  than  in  the  amplitude 
domain.  Here  we  denote  spur  in  the  phase  domain  as  6spur(0>  and  this  is  given  by 


9spur(0  =  d  x  2n  x  sinco,f 


(7.12) 


To  see  the  impact  of  such  a  sinusoidal  perturbation  of  ;spur  or  0spur  on  the  output 
frequency,  we  start  with  Figure  7.2a,  inject  the  spur,  and  redraw  the  resulting 
diagram  in  the  S-domain  as  shown  in  Figure  7.12.  Now  why  is  the  spur  represented 
by  4pur(X)/£pd  in  Figure  7.12?  Dividing  (7.11)  by  (7.12),  we  have 


spur 
'spur 
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charge  _pump 


271 


(7.13) 


Substituting  (7.9)  into  (7.13),  we  have 


^  spur  (/J         ^-pdfspurt/J 
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Fig.  7.12    Block  diagram  of  a  PLL  with  spur  sources  highlighted 


Taking  the  Laplace  transform  and  rearranging,  we  have: 


Jspu: 


rW  = 
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(7.14) 


Hence  the  spur,  0spur(s),  is  indeed  represented  by  4pur(s)/^pd>  which  explains  the 
representation  in  Figure  7.12. 

Continuing  with  our  discussion  on  the  impact  of  of  z'spur,  let  us  now  apply  a 
sinusoidal  jspur  at  the  phase  detector.  This  would  result  in  a  sinusoidal  phase  change 
at  the  output  that  equals  0O. 

In  the  S-domain  90  is  given  by 


e„ 


-'spur 


-'spur 


^spur  ^pd 


—  nn(\  — —  —  ti. 


'pd 


K, 


ref 


pd 


K, 


Pd 


(7.15) 
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Here  Hpd,  Href  are  the  transfer  functions  from  the  PD  and  reference  source  to  the 
synthesizer  output,  respectively.  Notice  that  since  the  PD  is  located  at  the  same 
node  as  the  reference  source,  Hpd—Hre{,  as  indicated  in  the  last  equality.  From  now 
on  we  will  use  HTet  to  represent  Hpd. 

The  60  would  phase  modulate  the  VCO.  Let  us  assume  that  the  VCO  output 
before  any  spur  is  injected  is  given  as  Vo=(AJ2)sin(0ot.  Here  AJ2  is  the  amplitude 
of  the  carrier.  If  we  assume  that  the  phase  modulation  is  narrowband  modulation 
(with  the  modulation  index  being  small,  which  is  almost  always  the  case  as  9spur, 
4pur  is  small),  then  V0  will  consist  of  the  original  frequency  component  at/„  and  two 
new  sidetones  at/0±/r  [15].  Mathematically,  V0  is  given  as 

A  A  9  A  9 

V0  =  -j  sin(co0f)  +  -~  sin((co0  +  co,.)f)  +  -~  sin((ro0  -  co,.)?)  (7.16) 

The  two  new  sidetones  constitute  spurs  at  the  output.  The  ratio  of  the  amplitudes 
of  these  spurs  to  the  amplitude  of  the  carrier  is  obtained  from  (7.16)  as 


spur  .amplitude 
carrier_amplitude 

and 

spur_power 


,1,0 

2 


carner_power 


±  -0O  (7.17a) 

2 


(7.17b) 


where  90  has  the  dimension  of  rad  and  is  a  ratio.  If  we  look  at  90   and  take  the  log  of 
it,  then  the  ratio  has  the  dimension  of  dBc. 

From  (7.17)  we  can  see  that  to  find  spur  amplitude/power  we  need  0O.  In  turn, 
from  (7.15),  we  need  the  reference  transfer  function  //ref.  This  transfer  function 
depends  on  the  loop  filter  transfer  function  F(s).  Hence  we  will  postpone  any  further 
discussion  on  how  to  calculate  spur  amplitude/power  until  after  the  loop  filter 
section  in  Chapter  8. 


7.3.5     Kpd 

We  have  explained  the  operation  of  the  PD/charge  pump  using  both  <7Pd_finai  C^pd') 
and  ;pd  (Kpd).  For  design  purposes,  it  is  easier  to  deal  with  current  rather  than  charge 
and  so  from  now  on  we  will  characterize  the  PD/charge  pump  with  L&  and  Kpd. 
Here  are  the  considerations  for  determining  A"pd: 

1.  A'pd  should  not  be  so  large  that  the  transistors  MX,M2  in  Figure  7.4  or  Figure  7.7a 
become  excessively  large. 

2.  Phase  noise  considerations.  We  will  jump  ahead  of  ourselves  a  little  bit  and 
extend  the  definition  of  the  synthesizer  output  phase  noise  due  to  VCO  phase 
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noise,  as  defined  in  (7.119),  to  the  definition  of  the  synthesizer  output  phase 
noise  due  to  PD  phase  noise,  SQ0_pd: 


Seo.pd  —  |#ref|  Se.pd  (7.18) 

Here  Se_pd  is  the  phase  noise  of  the  PD.  If  we  assume  that  the  PD  phase  noise  comes 
from  the  charge  pump,  then  Se_p(j  can  be  related  to  the  PSD  of  the  equivalent  input 
current  noise  of  the  charge  pump,  as  given  by  %- .  The  relationship  can  be  derived 
by  the  same  procedure  that  leads  to  (7.14)  and  is  given  as 

igd_ 

5e_pd  =  ^  (7.19) 

Substituting  (7.19)  into  (7.18),  we  have 

5eo_pd=|//ref|2^U^  (7-20) 

*pd 

The  input  here  is  ;n(j2,  which  has  the  dimension  of  A  .  Seopd  has  the  dimension  of 
dBc/Hz.  

As  will  be  shown  in  Chapter  8,  ;n(j2  is  only  weakly  dependent  on  /Charge_PumP- 
From  (7.9)  this  means  that  ;n(j2  is  also  only  weakly  dependent  on  Kpd.  Hence  from 
(7.20)  5e0_pc|  depends  roughly  on  the  inverse  of  Kpc^.  Accordingly,  Kpd  should  be 
selected  large  enough  to  make  any  output  phase  noise  due  to  charge  pump  insignif- 
icant when  compared  with  other  sources.  The  exact  value,  of  course,  depends  on 
HTei.  Hrel  again  depends  on  the  loop  filter.  We  will  postpone  further  discussion  until 
the  loop  filter  section  in  Chapter  8.  After  that  discussion  in  Chapter  8,  we  will  have  a 
chance  to  use  (7.18)  through  (7.20)  to  determine  Kpd.  Once  Kpd's  value  is  picked, 
we  can  use  the  following  equation  to  design  /Charge_PumP 

=  Wge.pump 

271 

Then  we  can  determine  the  size  of  M;  and  M 2. 


7.4    Dividers 

7.4.1     Survey  of  Different  Types  of  Divider 

We  begin  our  discussion  by  showing  some  examples  of  dividers  in  terms  of  increasing 
complexity  (Figure  7.13)  [19].  Referring  back  to  Figure  7.1,  we  note  that  the  input  to 
the  divider  is  denoted  as/„( V0)  and  output  is  denoted  as/divide  ( Vdhdde )  ■  This  notation  is 
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Fig.  7.13a    Indirect  divider 
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(ii) 


"  '  divide 


(b) 


Fig.  7.13b    (i)  T-FF  based  divide  by  2  divider,  (ii)  T-FF  implemented  using  D-FF 


carried  over  to  Figure  7.13.  Figure  7.13a  shows  an  indirect  divider  where  the  wave- 
form is  assumed  to  be  sinusoidal.  Let  us  assume  that  the  output  of  the  mixer  has  a 
frequency  component  /divide-  Hence  the  output  of  the  BPF  has  a  frequency  at  /divide  as 
well.  When  multiplied,  this  becomes  /divide  x  (N  —  1).  If  we  apply  this  signal  to  the 
mixer,  the  mixer  output  has  a  frequency  component  at/,  —/divide  x  (N  —  1),  which 
we  assumed  previously  is  /divide-  Equating  these  two  expressions,  we  can  solve  for 
/divide*  which  is  given  as/djvjde  =  % ,  and  division  is  achieved  as  desired.  Notice  that  in 
the  present  case,  division  is  performed  by  using  a  multiplier  (hence  the  term 
INDIRECT  DIVISION).  Due  to  feedback,  this  divider  is  slow. 

Figure  7. 13b  shows  a  faster  divider  that  is  based  on  digital  circuits.  This  time  the 
input  is  assumed  to  be  a  square  wave.  Let  us  assume  that  the  T-FF  (toggle  FF)  is  a 
positive  edge  trigger.  Accordingly,  this  FF  has  its  output  level  toggled  every  time  a 
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>T-FF1 


"Tff1_out 


>  T-FF2 


^divide 


vTff1   out 


"divide 


(C) 
Fig.  7.13c    T-FF  based  divide  by  4  divider  and  the  corresponding  waveform 


1 

2 

3 

2 

4 

3 

5 

1 

(d) 

Fig.  7.13d    Input,  output  waveform  of  a  generic  divider  that  divides  edges  non-uniformly 


positive  edge  arrives.  Then  it  is  easy  to  see  that  for  every  two  positive  edge  changes  in 
the  input  waveform,  there  is  one  positive  edge  change  in  the  output  waveform,  and 
hence  the  number  of  positive  edges  is  reduced  by  2  (divide  by  2).  Figure  7.13b(ii) 
shows  how  this  T-FF  can  be  easily  implemented  by  connecting  the  Q  output  of  a 
D-FF  back  to  itself.  Figure  7.13c  has  two  of  these  cascaded.  In  general,  we  can  also 
employ  feedback  to  obtain  a  non-binary  division  ratio. 

Let  us  now  do  some  comparisons  among  Figure  7.13a,  b,  and  c.  It  is  seen  that  the 
divider  of  Figure  7.13a  divides  the  input  frequency  by  two,  but  that  in  Figure  7.13b 
it  divides  the  input  edges  by  2.  In  general,  for  a  frequency  synthesizer  the  divider 
only  needs  to  divide  down  the  number  of  input  edges,  and  so  the  scheme  in 
Figure  7.13b  is  more  general.  Taking  this  one  step  further,  theoretically  we  do 
not  even  have  to  divide  down  the  input  edges  uniformly.  For  example,  a  divider  that 
divides  according  to  Figure  7.13d  will  also  work.  This  divider  essentially  produces 
four  output  positive  edges  for  eight  positive  input  edges,  and  therefore  achieves  a 
divide  by  2  operation,  although  in  an  average  sense.  Along  this  line  of  thinking,  it  is 
necessary  to  revisit  the  concept  of  division  and  classify  the  dividers  accordingly,  as 
will  be  done  in  the  following  section.  Finally,  to  have  a  variable  division  ratio,  the 
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Fig.  7.13e   A  programmable  preload  input 
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divider  can  be  implemented  as  shown  in  Figure  7. 1 3e,  as  a  programmable  counter  [20] . 
This  divider  is  essentially  a  counter  that  counts  from  the  preloaded  input  M  to  0, 
thereby  achieving  a  division  of  M.  Here  M  <  Mmax,  the  modulus  of  the  counter,  and  so 
the  maximum  division  ratio  is  Mmax.  This  counter  can  be  implemented  either  as  a 
ripple  or  a  synchronous  counter.  As  a  side  note,  unless  otherwise  specified,  from  now 
on  the  terms  divider,  prescaler,  and  counter  are  used  synonymously  [20]. 

The  preceding  discussion  gives  examples  of  various  types  of  dividers.  To  be 
more  rigorous  and  generic,  let  us  do  the  following  classifications:  fixed  divider  and 
programmable  divider  [20]. 

A  fixed  divider  means  that  the  divisor  (or  dividing  ratio)  is  fixed  and  cannot  be 
changed.  Examples  are  shown  in  Figure  7.13b,  c.  This  divider  has  the  inherent 
advantage  that  since  the  configuration  is  fixed,  the  critical  path  is  fixed  and  we  can 
optimize  speed  along  this  critical  path.  This  is  because  for  the  early  stage  that  carries 
out  division  on  the  most  significant  bit  (MSB),  the  input  signal  is  faster  and  therefore 
the  size  and  current  (hence  power)  should  be  designed  to  make  it  run  faster.  Alter- 
nately, we  can  see  that  the  critical  path  from  the  stage  for  the  MSB  back  to  the  stage  for 
the  least  significant  bit  (LSB)  is  fixed  and  thus  can  be  optimized. 

As  the  name  suggests,  a  programmable  divider  has  a  programmable  divisor. 
As  stated  previously,  this  can  be  achieved  by  using  a  programmable  counter,  as 
shown  in  Figure  7.13e.  For  a  given  speed  requirement  a  programmable  divider 
is  less  power  optimized.  This  is  so  because  with  programmability  the  critical  path  is 
dependent  on  the  loaded  value.  In  the  example  of  Figure  7.13e,  the  critical  path 
is  dependent  on  the  value  M  and  hence  the  designer  cannot  optimize  the  transistor 
sizing  a  priori. 

In  general,  a  frequency  synthesizer  does  need  a  programmable  division  ratio 
while  maintaining  low  power.  To  meet  this  challenge,  we  can  use  the  architecture  as 
shown  in  Figure  7. 14a.  Here  the  output  frequency/0  is  divided  by  a  fixed  counter  at 
high  frequency.  Since  the  first  counter  is  operating  at  high  frequency,  it  consumes  a 
lot  of  power.  Since  it  is  a  fixed  counter,  its  power  can  be  optimized.  To  make  the 
entire  division  ratio  programmable,  the  second  counter  must  be  programmable. 
Since  it  is  programmable,  it  cannot  be  optimized  for  power.  However,  since  it  is 
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(a) 


Fig.  7.14a  A  complete  divider  consisting  of  a  fixed  divider  cascaded  with  a  programmable 
divider 

operating  at  low  frequency,  even  without  power  optimization,  the  power  consump- 
tion is  not  prohibitive.  Selecting  /intermediate  involves  a  proper  trade-off  between  the 
relative  power  consumption  between  the  first  and  second  counter. 

Even  though  the  aforementioned  divider  achieves  better  overall  power  consump- 
tion than  a  single  programmable  counter,  it  does  reduce  the  resolution  when 
compared  with  the  single  programmable  counter  approach.  Assume  that  the  overall 
division  ratio  is 


M  =  NiXN2 


(7.22) 


Here  M  is  the  complete  divisor  needed,  Nx  is  the  divisor  of  the  fixed  divider,  and  N2 
is  the  divisor  of  the  programmable  divider.  The  overall  resolution  is  N\.  This  is 
because  changing  counter  N2  by  1  results  in  the  overall  division  ratio  changing  by 
N1.  In  general,  we  have 


reSOlUtlOnncompiete_divider  —  reSOlUtlOnp10grammable_dividei- 

x  division_ratiofixed_divider 


(7.23) 


To  improve  the  resolution  of  the  complete  divider,  one  approach  is  to  make  the 
fixed  divider  also  programmable.  Since  full  programmability  consumes  too  much 
power,  we  would  like  to  restrict  its  programmability,  which  is  done  by  restricting  the 
number  of  moduli  it  can  divide.  The  simplest  case  is  to  make  it  a  dual  modulus  counter. 
A  dual  modulus  counter  can  further  be  viewed  as  a  fixed  counter  with  a  pulse 
swallower,  which  has  the  flexibility  of  counting  one  more  pulse  before  generating 
an  output  or,  equivalently,  inserting  one  more  pulse  before  generating  an  output.  This 
concept  is  illustrated  by  an  example  as  shown  in  Figure  7.14b  through  Figure  7.14e. 
First,  back  in  Figure  7.14a  we  pick  N2  max  =  4.  This  will  be  used  in  Figure  7.14b 
through  Figure  7.14e.  In  Figure  7.14b,  we  set  Ai=3,  N2  =  4  and/c 


divide 


_4 

12- 


Next,  we  want  to  switch  to  another  frequency,  say,  by  making  /divide  =  {j  ■ 
How  do  we  achieve  this?  The  simplest  way  is  if  M  =  Ni  x  N2  is  programmable 
by  1  and  we  change  M  from  12  to  11.  This  is  not  possible  here.  Since  the  only 
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fdivide  =   f„/12 


(b) 


Fig.  7.14b    Waveform  of  the  complete  divider,  W,=3,  ^=4 

_r 


^divide  =  V" 


(C) 


Fig.  7.14c    Waveform  of  the  complete  divider,  N1=3,  N2=3 


1  2 


fdlv,de=fo/10 


(d) 

Fig.  7.14d  Waveform  of  the  complete  divider,  with  the  fixed  divider  replaced  by  a  dual  modulus 
divider.  AT,=3/4,  N2=3.  Here  iVi=4  for  1  N2  cycle 

programmable  counter  is  the  second  counter,  let  us  first  decrease  W2  by  1  with  iVj 
remaining  at  3.  The  result  is  shown  in  Figure  7.14c,  where  there  are  only  nine/0 
edges  for  1  /divide  edge  and  so  /divide  =  f  ■  Next,  to  make  up  for  the  lost  edges,  we 
modify  the  first  counter  so  that  it  can  swallow  pulse.  The  first  counter  is  now  called 
a  pulse  swallower  or  edge  inserter.  It  works  by  swallowing  one  extra  pulse  (or 
inserting  one  extra  edge)  before  we  generate  the  output.  Hence  if  we  turn  on  the 
pulse  swallower  (edge  inserter)  function  of  the  first  counter,  the  results  will  be  as 
shown  in  Figure  7.14d.  Here  the  first  counter,  in  the  first  cycle,  counts  four  rather 
than  three/,  edges  before  it  generates  a  /intermediate  edge.  Let  us  assume  that  in  the 
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(e) 

Fig.  7.14e    Waveform  of  the  complete  divider,  Afi=3/4,  ^2=3.  Here  Nt=4  for  two  N2  cycles 

'•  jmiu]  siimm]  rum 


(f) 

Fig.  7.14f   Waveform  of  the  complete  divider,  Ni=4/5,  ^2=3.  Here  Ni=4  for  three  N2  cycles 

subsequent  cycles,  we  do  not  instruct  the  pulse  swallower  (edge  inserter)  to  swallow 
pulse  (or  insert  edge)  and  therefore  it  divides  by  3.  An  equivalent  way  of  looking  at 
it  is  that  the  first  counter  has  two  possible  moduli  (divide  by  4  and  divide  by  3),  so  it 
is  a  programmable  counter,  but  unlike  an  ordinary  programmable  counter,  its 
division  ratio  changes  on  the  fly.  Now  we  have  made  /divide  =  %  ■  To  get 
/divide  =  ff ,  we  need  to  insert  some  more  edges.  Note  that  we  have  some 
restrictions:  that  is,  the  first  counter  can  insert  one  and  only  one  edge  (not  two 
edges)  per /intermedite  cycle.  Hence  to  get  to  fr,  we  insert  another  edge  in  the  next 


'divide  =  ff  ■  We  have  finally 
achieved  resolution  =  1  since/diVjde  can  assume  values  of  yg ,    yy  or  all  the  possible 


/intermediate  cycle,  as  shown  in  Figure  7.14e  and  /c 


values  between  ||  and  Jf .  We  achieve  this  level  of  resolution  by  programming  A^ 
(telling  it  which  modulus  to  use)  and  also  how  many  times  it  uses  that  particular 
modulus  (how  many  times  it  insert  edges). 

Finally,  note  that  we  have  an  overall  restrictions  in  choosing  A/2  max  and  A^. 
The  restriction  is 


N2a 


>N, 


(7.24) 


Let  us  violate  this  inequality  deliberately  and  see  what  happens.  For  Figure  7. 14f 
through  Figure  7.14h,  let  us  go  back  to  the  divider  in  Figure  7.14a  and  pick N2  max  =  3. 
Initially,  in  Figure  7. 14f,  we  set  A'1=4,  A'2  =  N2  max  =  3,  and  so/divide  is  still  || .  To 
get  to  yj  ,  we  first  decrease  ^2  to  2  with  N\  remaining  at  4.  As  shown  in  Figure  7. 14g, 
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Fig.  7.14g   Waveform  of  the  complete  divider,  N{=4/5,  N2=2.  Here  W,=4  for  two  N2  cycles 
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Fig.  7.14h    Waveform  of  the  complete  divider,  W,=4/5,  N2=2.  Here  W|=5  for  two  N2  cycles 

/divide  =  f  ■  Next,  in  Figure  7.14h,  we  try  to  make  up  for  the  lost  edges  by  inserting 
extra  f0  edges  in  all  the  /intermediate  cycles  and  the  maximum  we  can  get  is  only 
10  edges  (i.e., /0  =,%p).  Since  we  have  the  restrictions  that  only  1  f0  edge  can  be 
added  per  /intermediate  cycle,  we  have  run  out  of/intermediate  cycles  to  add  any  more 
edges  and/0  =  ^ff-  cannot  be  generated.  This  is  the  consequence  of  violating  (7.24). 


7.4.2    Example  of  a  Complete  Divider  (DECT  Application) 

Let  us  now  apply  this  dual  modulus  counter  concept  to  our  DECT  example  [20]. 
To  do  this  we  start  from  Figure  7. 14a,  replace  the  fixed  counter  iVi  by  a  dual  modulus 
counter  N\  and  keep  the  second-stage  programmable  counter  N2-  The  resulting 
divider  is  shown  in  Figure  7.15  [19].  For  a  DECT  standard,  the  division  ratio  is 


M  =    1089  -  1098 


(7.25) 


First  we  need  to  decide  on/intermediate.  Now  /intermediate  can  be  chosen  subject  to 
power  consumption  and  the  inequality  (7.24).  One  choice  involves  making 


N2l 


I6N1 


(7.26) 
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Fig.  7.15    A  complete  divider  for  DECT  application 


Substituting  (7.26)  and  (7.25)  into  (7.22),  we  see  that  we  can  satisfy  these 


equations  by  assigning  A'1 


AT, 


=  136.  Hence  the  divider  has  a  dual 


modulus  divide  by  8/9  first  stage  counter  followed  by  a  programmable  divide  by 
136  second  stage.  The  /intermediate  becomes  f+s/+g.  Note  that  for  DECT,  there  are 
only  10  channels,  so  the  second  divider,  strictly  speaking,  does  not  need  to  be  fully 
programmable.  We  will  leave  it  as  a  fully  programmable  divider  so  that  the  divider 
architecture  presented  can  be  applied  in  all  general  cases. 

Let  us  examine  how  the  N\  counter  shown  in  Figure  7.15  is  implemented.  This  is 
shown  in  Figure  7.16  [19].  Figure  7.16a  shows  the  state  table  of  the  -=-  8/9  counter 
whose  implementation  is  shown  in  Figure  7.16b.  Let  us  see  how  it  works.  From 
Figure  7.16a  notice  that  when  the  swallow  control  signal  (SC)  =  1,  there  are  two 
counts  of  0  state.  Normally  as  far  as  the  output  of  the  counter  is  concerned,  it 
expects  one  output  pulse  for  every  eight  input  pulses.  This  time  it  is  still  getting  the 
same  one  output  pulse  without  knowing  that  the  input  has  counted  one  extra  pulse. 
This  is  the  principle  of  pulse  swallowing. 

Let  us  look  at  Figure  7.17  for  another  example  of  a  dual  modulus  counter  (this 
time  it  is  a  -=-4,  -=-5  counter)  [21].  Figure  7. 17a  shows  the  state  diagram  and  Figure 
7.17b  shows  the  implementation.  From  Figure  7.17a,  the  state  diagram  is  more 
interesting  than  the  state  table  of  Figure  7.16a.  Here  the  control  signal  is  the  SC 
signal  (labeled  along  the  paths),  where  an  X  means  "don't  care".  Notice  that  there 
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d3(=fps) 

1 

0 

1 

0 

1 

1 

1 

1 

1 
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1          only  1  pulse  for  '0'  state, 
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1          divide  by  9,  SC  =  1: 

0 
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0 

0         2  pulses  for  '0'  states, 

0 

0 

0 

1          pulse  swallow 

1 

0 

0 

(a) 

Fig.  7.16a    State  table  of  a  -=-8/-5-9  dual  modulus  counter  (dual  modulus  counter  whose  state  table 
has  one  path) 


d2(=fW+9) 


(b) 


Fig.  7.16b    Logic  implementation  of  a  -=-! 
state  table  has  one  path) 


'-5-9  dual  modulus  counter  (dual  modulus  counter  whose 


are  two  possible  state  paths:  A  and  B,  each  consisting  of  two  sequences,  a  -f-  4  and 
a  -7-  5  sequence.  Assuming  the  notation  of  d2,  d\,  d0,  then  in  path  A,  the  -f-5 
sequence  is  from  000,  001,  011,  010,  100,  then  back  to  000.  The  -5-4  sequence  is 
000,  001,  011,  010,  000.  For  path  B,  the  ~5  sequence  is  000,  001,  011,  110,  100, 
000  and  the  -f-4  sequence  is  000,  001,  011,  110,  000.  Notice  the  SC  signal  decides 
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Path  B 


\      4/5  decision  point  >. 


(a) 

Fig.  7.17a    State  diagram  of  a  -T-4/-7-5  dual  modulus  counter  (dual  modulus  counter  whose  state 
diagram  has  two  paths) 
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(b) 

Fig.  7.17b    Logic  implementation  of  a  -T-4/-S-5  dual  modulus  counter  (dual  modulus  counter  whose 
state  diagram  has  two  paths) 


not  only  if  pulse  is  swallowed  but  also  whether  path  A  or  B  is  followed.  If  SC  is 
enabled  at  Oil,  path  B  will  follow,  and  vice  versa.  Why  is  this  being  done?  The 
answer  lies  in  the  implementation,  as  shown  in  Figure  7.17b.  Here  we  can  see  that 
less  logic  is  involved  in  the  critical  path  from  SC  to/intermediate>  when  compared  with 
Figure  7.16b,  making  the  divider  faster. 

Next  let  us  go  back  to  Figure  7.15  and  finish  our  discussion  on  how  the  complete 
divider  works.We  have  explained  in  Figure  7. 16  how  the  -=-  8/9  dual  modulus  counter 
operates.  According  to  Figure  7. 14d,  in  a  dual  modulus  divider  a  mechanism  is  needed 
to  control  the  number  of  N2  cycles  in  which  the  counter  TV]  insert  edges  (swallow 
pulses).  This  is  implemented  by  the  blocks  labelled  "swallow  cycle  counter"  and 
"comparator"  in  Figure  7.15.  The  swallow  cycle  counter  is  preloaded  with  the  number 
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of  swallow  cycles  [in  Figure  7. 14d  this  should  be  1].  It  then  counts  down,  starting  from 
this  number.  During  the  countdown  its  output  is  1  and  hence  the  comparator  output 
(=SC)  is  also  1.  Therefore,  the  dual  modulus  divider  swallows  pulses.  When  the 
swallow  cycle  counter  counts  to  zero,  the  comparator  output  (=SC)  changes  from  1  to 
0.  Then  the  dual  modulus  divider  will  no  longer  swallow  pulses  [19]. 

We  are  now  ready  to  go  through  the  operation  of  the  complete  divider  in 
Figure  7.15.  We  refer  to  the  timing  diagram  in  Figure  7.18  and  start  by  going 
through  one  cycle  of  the  normal  operation  (i.e.,  no  swallowing).  Hence  initially  f0 
passes  through  the  pulse  swallower  unchanged.  It  is  divided  by  8  to  generate  f^s/^9- 
The  swallow  cycle  counter  is  a  down  counter  that  counts  from  its  preloaded  value 
"number  of  swallow  cycles"  down  to  0  and  stays  there  until  the  next  time  the 
counter  output  is  preloaded  with  a  non  0  value.  Accordingly,  initially  the  swallow 
cycle  counter  output  is  stuck  at  zero  and  the  comparator  output  is  0.  This  means 
that  SC  is  zero,  as  expected.  This  stays  on  until  the  next  interesting  event  comes 
along.  This  happens  when  the  counter  is  being  preloaded  with  a  value  "number  of 
swallow  cycles",  set  in  the  present  example  to  be  2.  (Incidentally,  preloading 
always  happens  at  the  beginning  of  the  f^.s^.l36  cycle,  denoted  to  be  the  grand 
cycle.)  Preloading  occurs  at  the  rising  edge  of/^.8^.136,  as  shown.  At  that  point  the 
preload  enable  is  activated,  and  a  value  of  2  is  loaded  into  the  swallow  cycle 
counter.  This  counter  output  is  immediately  set  to  2,  firing  up  the  comparator  and 
enabling  SC.  As  a  result,  when  the  next  f0  edge  comes  along,  the  pulse  is 
swallowed,  as  indicated  by  the  dotted  pulse  in  the  /ps  trace.  Therefore,  the  -=-8/9 
counter  counts  one  more  pulse  and  its  output,  /Lg/^g,  toggles  in  nine  (as  opposed 
to  eight)  f0  cycles.  When  the  -=-8/9  counter  output  toggles,  it  clocks  the  swallow 
cycle  counter,  which  then  decreases  its  output  by  1 ,  from  2  to  1 .  The  comparator 
still  has  a  positive  output,  SC  is  still  1,  the  pulse  swallower  is  still  activated,  and 
the  same  thing  repeats  itself.  Therefore,  the  next/^g/H_9  cycle  contains  again  nine 
fa  cycles.  At  the  end  of  the  next/lg/.^  cycle,  however,  the  swallow  cycle  counter 
output  counts  down  from  1  to  0.  This  sets  the  comparator  output  to  0.  As  a  result, 
SC  is  0  and  the  pulse  swallower  is  disabled.  Since  the  swallow  cycle  counter  is  a 
down  counter,  once  its  output  reaches  0,  it  stays  there.  Hence  from  now  on  one 
,/^8/h-9  cycle  reverts  to  containing  eight  f0  cycles.  This  continues  until  the 
counter  is  reloaded  with  2  again,  which  happens  134  f+s  cycles  later,  by  the 
output  edge  of  the  N2  counter. 

To  show  that  the  complete  divider  can  actually  achieve  a  resolution  of  1,  let 
us  look  at  the  number  of  input  pulses  in  af+g/+13(,  cycle,  which  can  be  calculated 
as  follows  [19]: 

Number  of  input  pulses  to  the  TVj  divider  in  a.f+&/+136  cycle  =  total  f0  pulses  in  a 

./Vs/-i36cycle   =   2  x  9  +    134  x  8  (7.27) 

Number  of  output  pulses  from  the  N\  divider  in  a/^_8/^136  cycle  =  total /^g/^g  pulses 
in  a/lg/H-Bg  cycle  =136  (7.28) 
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number  of  input  pulses  number  of  f0  pulses 

Division  ratio  of  N\ 


number  of  output  pulses       number  of  /lg/+9  pulses 

(2x9+134x1 


136 


-=8.0147  (7.29) 


Therefore,  we  achieve  a  fractional  division  of  0.0147 

What  is  the  smallest  fraction  we  can  achieve?  This  will  be  achieved  obviously 
when  number  of  swallow  cycles  =1.  Then 

(1  x  9+  135  x  8) 

division  ratio  of  N\  =  - -  =  8.00735 

136 

Hence  the  smallest  fractional  part  is  0.00735  and  the  resolution  of  Nx  is  0.00375. 
Accordingly,  we  have 

Resolutioncompiete_divider  =   0.00735  x  136   =    1  (7.30) 

To  go  through  some  more  examples,  let  us  set  "number  of  swallow  cycles"=0 
and  we  observe  that  the  division  ratio  of  Ni  becomes  8.  On  the  other  hand,  if 
"number  of  swallow  cycles"=135,  it  becomes  8.99265  and  if  "number  of  swallow 
cycles"=  136  (maximum  number  of  swallow  cycles  allowable),  it  becomes  9.  So  we 
can  see  that  the  division  ratio  of  N\  can  be  programmed  to  cover  all  0.00725 
increments,  spanning  the  entire  interval  from  8  to  9.  In  other  words  the  complete 
division  ratio  M  varies  from  8  x  136  to  9  x  136,  or  1088  to  1224.  This  spans  the 
division  ratio  used  for  DECT  (1088-1098).  What  happens  if  the  modulus  of 
counter  Nj  changes?  In  that  case  the  division  ratio  spans  the  rest  of  the  range 
(8  through  1088),  which  is  of  no  interest  to  DECT. 


7.5     VCO:  Introduction 
7.5.1     Categorization 

VCOs  can  be  categorized  by  method  of  oscillation  into  resonator-based  oscillators 
versus  waveform-based  oscillators,  as  shown  in  Figure  7.19.  Primary  examples  of 
each  category  are  the  LC  oscillator  and  the  ring  oscillator,  respectively.  Each  type 
has  different  ways  of  doing  frequency  tuning  [which  is  mathematically  descrbed  by 
(7.1)]:  current  steering  for  ring  oscillators  and  variable  capacitor  (or  varactor)  for 
LC  oscillators.  Because  of  the  need  for  integrability,  ring  oscillator  is  very  desirable 
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oscillators 

I 


resonator  oscillator  waveform  oscillators 


LC  osc.  Crystal  osc.  Relaxaton  osc.  Ring  osc. 

Fig.  7.19    Categorization  of  VCOs 

in  a  VLSI  environment,  and  most  of  our  discussion  focused  on  this  type  of 
oscillator.  Relaxation  VCO,  another  waveform  based  VCO,  is  usually  not  a  good 
choice  for  the  present  application  due  to  the  huge  amount  of  phase  noise  introduced 
as  a  result  of  positive  feedback.  A  more  thorough  organization  and  description  of 
some  of  these  various  structures  follows. 

We  start  by  focusing  our  attention  on  resonator  based  oscillator.  First,  we  review 
the  underlying  principles  that  analyze  the  operation  of  resonator  based  oscillator. 
There  are  two  approaches  to  the  analysis  of  resonator  based  oscillators: 

1 .  Feedback  approach 

2.  Negative  resistance  approach 

The  advantage  of  adopting  the  feedback  approach  is  that  it  makes  use  of  negative 
feedback  amplifier  theory,  which  is  familiar  to  most  circuit  designers.  Hence  we 
adopt  this  approach.  In  Chapter  3  we  adopted  feedback  theory  in  analyzing 
the  distortion,  frequency  response,  and  other  behaviour  of  an  amplifier,  but  in  the 
context  that  the  feedback  action  is  negative.  Since  an  oscillator  works  under 
the  principle  of  positive  feedback,  we  review  feedback  theory  and  see  how  it  applies 
to  the  present  situation. 

As  a  side  note,  we  should  note  that  we  can  apply  small  signal  analysis  to 
resonator-based  oscillators.  Accordingly,  these  oscillators  can  be  described  by  a 
linear  time  invariant  (LTI)  system  using  S-domain  representation.  Thus  feedback 
theory  for  the  LTI  circuit  applies.  On  the  other  hand,  for  waveform-based 
oscillators,  small-signal  analysis  is  not  applicable.  Hence  the  feedback  theory 
reviewed  in  subsection  7.5.2  is  only  applicable  to  resonator-based  oscillators. 


7.5.2    Review  of  Positive  Feedback  Theory 

We  first  review  positive  feedback  theory  for  an  LTI  circuit  [23].  Let  us  represent  a 
feedback  circuit  in  the  S-domain  as  shown  in  Figure  7.20. 
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Fig.  7.20    Block  diagram  of  a 
feedback  circuit 


S,(s) 


Sfb(s) 


a(s) 


f(s) 


S0(s) 


Here 


So  a(s) 


l  +  a(s)f(s) 


(7.31) 


As  an  example,  let  us  assume  that 


a(s)  = 


(i  +  szy 


(7.32) 


We  further  assume  that  /(s)  is  negative  (so  overall  feedback  is  positive),  resis- 
tive, and  constant.  Let  us  now  close  the  feedback  loop.  Then 


So  _         a(s)         _  k 

Ji~\  +  a(S)f(S)  ~  (/,  _  7\  (i  _  A  (i  _  g 


(7.33) 


We  see  that  the  transfer  function  consists  of  three  poles,  where  the  poles  are  roots 
of  the  equation. 


l+a(s)f(S)  =  0 
1+^^  =  0 

(1+.VT)3 

(1  +sx)   =  —af 

1  +  sx  =  v^qf 
To  take  care  of  the  -1  inside  the  cube  root  we  observe  that 

-  1  =  lZ180°  +  «360°/o''«=  1,2,3 


(7.34a) 
(7.34b) 

(7.34c) 
(7.34d) 

(7.35a) 


V-\  =  1/60°  +  n  120° 


(7.35b) 
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Fig.  7.21    Root  locus  of  the  example  feedback  circuit 
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Fig.  7.22    Pole/zero  diagram  of  the  example  feedback  circuit 

=  -1,  1760°,  lZ-60° 


(7.35c) 


Substituting  the  cube  root  of  -1  and  normalizing  %  =  1,  we  find  the  poles  of  the 
transfer  function  as 

Si=-l-  {/of 

Si=-l+  {/of  160° 

$3=  -1  +  {/afl-  60°  (7.36) 

where  af  is  the  low-frequency  loop  gain. 

Let  us  plot  root  locus  as  a  function  of  af  in  Figure  7.21. 

Notice  that  when  af—  8,  two  of  the  poles  have  just  reached  the  imaginary  axis. 
For  af  >  8  we  have  right  half  plane  (RHP)  poles,  shown  in  Figure  7.22. 
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Eventually  oscillation  reaches  steady  state.  Notice  that  our  analysis  so  far  only 
shows  that  oscillation  will  start.  It  needs  nonlinear  analysis,  which  is  outside  the 
scope  of  this  book,  to  shows  that  a  steady  state  is  reached. 

To  get  a  more  complete  picture  of  the  behaviour  of  this  transfer  function,  let  us 
plot  the  Nyquist  plot,  which  is  a  plot  of  af  ( /'co)  in  magnitude  and  phase  on  a  polar 
plot  using  co  as  the  variable.  In  the  present  case,  af(j(a)  =     "{   , ,  with  x  =  1. 

As  a  check  as  co  — >  oo(l  +  Jco)  — >  (y'co)  so  angle  is  -270  °,  which  agrees  with 
the  plot.  As  a  condition  of  oscillation,  we  are  interested  in  the  frequency  where 
laf(ja)  =  -180°. 


Let  us  try 


Upon  substitution,  we  get 


co 


V3,  x  =  1  (7.36a) 


«/(»  =  -f  (7.36b) 

Hence  we  can  make  two  observations: 

1 .  Feedback  in  this  circuit  is  negative  at  co  =  0. 

2.  Feedback  is  positive  at  co  =  y3 

To  oscillate,  we  need  one  more  condition: 

loop  gain  >  1  (7.37) 

We  now  derive  what  this  condition  translates  into.  Mathematically, 

So  <AD) 


Now  we  put  co  =  y/3. 


Si  1  +  af(ja) 


«/•(./»)  =  -? 


To  set  loop  gain  =  1,  we  put  af  =  8.  Transfer  function  with  feedback  applied  is 


St  [J(0)  ^  1  +  fl/t/co)       1-1 


(7.38) 


This  indicates  oscillation  and  hence  a  loop  gain  at  low  frequency  =  8  causes 
oscillation. 

As  a  check,  referring  to  Figure  7.23,  oscillation  should  occur  when  the  Nyquist 
plot  passes  through  (—1,0),  and  this  indeed  happens  at  af  —  8.  As  a  final  check, 
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Fig.  7.23    Nyquist  plot  of  the 
example  feedback  circuit. 
This  diagram  scales  directly 
and  linearly  with  the  low 
frequency  loop  gain  (af) 
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Fig.  7.24    Frequency  response  of  the  example  feedback  circuit 


referring  to  Figure  7.21,  S-plane  analysis  tells  us  that  oscillation  should  occur  when 
the  root  locus  pass  through  the  imaginary  axis,  and  the  plot  shows  that  this  does 
happens  at  a/=8. 

One  natural  question  to  ask  is,  what  happens  if  loop  gain  >  8?  It  turns  out  that 
this  is  not  obvious  from  the  Nyquist  plot.  Mathematically,  the  number  of  times  the 
Nyquist  plot  encircles  (—1,0)  =  the  number  of  RHP  poles  but  does  not  give  us  any 


H/«) 


,  as  af 


more  information.  To  answer  that  question,  let  us  plot  overall  gain, 

increases  in  Figure  7.24.  As  af>  8,  gain  goes  to  oo.  This  indicates  the  presence  of 

RHP  poles. 

Complete  response  of  a  circuit  with  RHP  poles  to  any  input  is 


S0  =  /(input)  +  Kie'"2'  +  K1e<n'  cos  co0't 


(7.39) 


Here  /(input)  is  the  forced  response  (amplifier  response),^ e~ai-'  is  the 
decaying  exponential  natural  response,  A^e^'coscur/?  is  the  oscillation,  which  is 
shown  in  the  Figure  7.25. 

Notice  that  in  Figure  7.25,  linear  analysis  is  valid  in  the  first  part  of  the  waveform 
and  hence  the  zero  crossing  frequency  equals  <d0  and  is  set  by  RHP  pole  location. 
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Fig.  7.25    Transient  response  of  the  example  feedback  circuit 

In  the  second  part  of  the  waveform  the  effect  of  nonlinearity  in  active  devices 
eventually  dampens  the  growing  exponential  so  that  it  has  a  steady-state  waveform 
with  constant  amplitude.  Here  the  zero  crossing  frequency  is  not  necessarily  coq  . 
Since  we  want  the  frequency  of  oscillation  to  be  stable,  the  circuit  is  not  often  used 
for  oscillation. 

To  summarize,  for  oscillation,  we  want  positive  feedback  so  the  phase  shift  of 
the  loop  gain  is  180°  and  the  magnitude  of  loop  gain  is  equal  to  or  greater  than  1 
while  maintaining  a  stable  and  predictable  frequency  of  oscillation.  To  do  this  we 
must  establish  accurately  the  frequency  where  the  total  loop  phase  shift  is  0°.  This  is 
key  to  all  oscillator  designs.  Our  first  attempt  toward  maintaining  a  stable  frequency 
of  oscillation  involves  the  use  of  a  LC  tank  in  an  oscillator.  This  oscillator  further 
uses  a  transformer  to  do  phase  inversion,  which  is  necessary  for  positive  feedback. 
This  architecture  has  the  advantage  that  one  can  accurately  define  the  frequency  of 
zero  loop  phase  shift. 


7.6    LC  Oscillators 

7.6.1     Basic  LC  Oscillators 

A  resonator-based  oscillator  that  uses  aLC  tank  as  a  resonator  is  shown  in  Figure7.26. 
The  transformer  is  used  to  do  phase  inversion,  making/(j)  in  (7.3 1)  negative  and  thus 
achieving  positive  feedback.  Transistor  M\  is,  of  course,  used  to  implement  a(s)  in 
(7.31).  Here  C\,  C-i  are  large  bypass  capacitors  and  Rlt  R2,  Re  set  bias. 
L,  C  resonates  at 


1 


COo 


LC 


(7.40) 


In  (7.40),  C  is  not  simply  CL,  as  the  transformer  reflects  impedance  from  the  input. 
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} 


T 


Fig.  7.26    Transformer  based  LC  oscillator 


±<=2 


A  stable  and  well-specified  frequency  in  this  circuit  is  co0,  the  LC  resonance 
frequency.  We  will  try  to  make  this  the  frequency  of  oscillator.  Hence  we  want  360° 
phase  shift  at 


COO 


1 


LC 


(7.41) 


and 


af  >  1  at  co0 


(7.42) 


for  oscillation. 

To  analyze,  let  us  eliminate  the  bypass  capacitors  and  bias  resistors  from  Figure 
7.26  and  redraw  Figure  7.26  in  Figure  7.27.  V^  becomes  ac  ground  as  well.  At 
resonance,  impedance  of  a  LC  tank  circuit  becomes  infinite.  The  transformer's 
action  is  represented  by  setting  v,  =  vjn,  where  n  is  the  turn  ratio.  Hence  at 
resonance  Figure  7.27  becomes  Figure  7.28. 

From  Figure  7.28,  loop  gain  af  at  co0  is 


Loop  gain  = 


8mR 


(7.43) 


In  (7.43),  gm  is  the  transconductance  of  transistor  Mu  but  R  is  not  simply  RL,  as 
the  transformer  reflects  impedance  from  input. 

To  find  R,  let  us  form  the  small-signal  equivalent  of  the  complete  ac  circuit  as 
described  in  Figure  7.27  and  analyze  using  Laplace  transforms.  This  small-signal 
circuit  is  shown  in  Figure  7.29,  where 


./•  =  l/« 


(7.44) 
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Fig.  7.27    Transformer-based  LC  oscillator  with  biasing  removed 
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Vj  =  V0/n 


v- 


Fig.  7.28    Transformer-based  LC  oscillator  at  resonance 


4 I- 


Fig.  7.29    Small-signal  circuit  of  transformer-based  LC  oscillator 
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Fig.  7.30    Total  impedance, 
including  reflected 
impedance 


l  -Lc 


Notice  this 
R  reflected  input 


z 


a(s) 


-gmZT(s) 


(7.45) 


ZT  is  described  in  Figure  7.30.  Rt  and  C,  in  Figure  7.29  are  the  input  resistance 
and  capacitance  of  Mj.  Rj  and  C,  are  reflected  through  the  transformer  to  affect  R 
and  C  in  Figure  7.30.  We  can  refer  to  Figure  7.29  to  find  R,  C: 


C  =  C, 


C, 


R  =  R0\\RL\\n2Ri 


(7.46) 


Next  we  substitute  (7.46)  into  (7.45)  to  get  a(s).  This  resulting  expression  and 
(7.44)  are  then  multiplied  together  to  get 


<*f(s) 


gmR  P 

n    l  +  ^s  +  LCs2 


(7.47) 


To  start  oscillation,  we  insert  a  small  noise  source  Vx  as  shown  in  Figure  7.29. 
This  has  the  transfer  function 


Vo=      a{s) 
Vx       1  +  af(s) 


(7.48) 


Substituting  (7.45),  and  (7.47)  in  (7.48),  we  get 


V'o 


-gmRr—L 


l+^s+LCs2 


Vx  _  grrM  |^ 

A  i  ^ 


n     \+^  +  LCs2 


(7.49a) 


Vo 
Vx 


L 

gmR^S 


l+-s  +  LCsl -s 

R  n    R 


(7.49b) 
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This  transfer  function  has  one  zero  and  two  poles  and  so  can  be  represented  as 


We  now  apply  feedback  by  making  n  a  finite  number. 
Specifically,  we  set 


^=1  (7.51) 


Substituting  this  in  (7.49b),  we  find  that  the  denominator  of  the  transfer  function 
becomes 

1  +  LCs2  =  ( 1  +  jsVLC)  ( 1  -  jsVLc)  (7.52) 

This  has  solutions: 

J  J 


Hence  the  poles/zeroes  of 


(7.49b)  are  as  shown  in  Figure  7.31. 

v„ 


(CO) 


as  a  function  of  co,  it  will  peak  up  at  co0,  as 


Consequently,  if  we  plot 
shown  in  Figure  7.32. 

To  plot  the  root  locus,  let  us  revisit  the  denominator  of  the  transfer  function. 
From  (7.49b)  we  have 

denominator  of  transfer  function   =  1  +  as  +  bs2  =  b[  s2  +  -s  +  -  )       (7.53) 

\         b        bj 

Setting  the  denominator  to  zero,  we  have  the  following  solutions: 


:  _   .  i  ia\2        4 


-!±V&  -* 


si,s2  =  ^ j (7-54) 

Simplifying,  we  have 


If  the  quality  factor  (Q)  of  the  RLC  tank  is  high,  S\,S2  are  complex  and  (7.55)  can 
be  rewritten  as 
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Fig.  7.31    Pole/zero  diagram  of  transformer-based  LC  oscillator 


^(») 


«0: 


LC 


Fig.  7.32    Frequency  response  of  transformer-based  LC  oscillator 


S\,S2=  - 

Let  s  —  x  ±  jy;  then  from  (7.56) 


a         a      4b 

2b±j2-hy^-1 


CO 


r-. ?         /a2        a2   (4b        \         1  1 


(7.56) 


(7.57) 


From  (7.57)  we  infer  that  the  poles  always  have  constant  magnitudes.  This  is  true 
for  any  loop  gain  and  so  the  root  locus  is  a  circle.  This  is  shown  in  Figure  7.33. 


7.6.2    Alternate  LC  Oscillator  Topologies 


7.6.2.1     Common  gate  oscillator 

The  oscillator  in  Figure  7.26  can  be  redrawn  with  a  common  gate  configuration,  as 
shown  in  Figure  7.34.  Since  there  is  no  phase  shift  in  the  transformer,  we  do  not 
need  transformers  and  we  can  use  capacitors  (use  capacitive  transformers). 
The  result  is  the  Colpitts  oscillator  shown  in  Figure  7.35. 
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positive  feedback 
enhances  Q  and 
narrows  the  bandwidth 


negative  feedback 
broadbands  the 
amplifier 


Fig.  7.33    Root  locus  of  transformer-based  LC  oscillator 
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Fig.  7.34    Common  gate  LC  oscillator 
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Fig.  7.35    Colpitts  oscillator 
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This  is  the  resonant  loop. 
The  sum  of  impedance  around  loop=0. 
Therefore  there  is  large  reactive  current  flow 
around  loop  =  Qx  current  in  RL 


Fig.  7.36    Colpitts  oscillator,  resonant  loop  highlighted 

7.6.2.2     Colpitts  Oscillator 

Comparing  Figure  7.35  with  Figure  7.26  we  note  that  the  «,  co0  derived  for  Figure 
7.26  now  assume  the  following  form: 


C2 


Ci+C2 


,«)o 


1 


l^Cl  +  ^ 


(7.58) 


We  now  offer  some  comments  on  this  Colpitts  oscillator.  First,  Figure  7.35 
is  redrawn  in  Figure  7.36.  We  omit  CL  for  the  time  being.  Notice  that  the  L,  Ci,  C2 
loop  is  the  resonant  loop.  Hence,  the  sum  of  impedance  around  the  loop  is  0  at  co0. 
Therefore,  there  is  a  large  reactive  current  flowing  around  this  loop,  whose  value 
is  given  by  Q  x  current  in  RL. 

Now  we  can  analyze  Figure  7.36  as  follow.  From  the  capacitive  divider  formula, 


C, 


V, 


v0     d  +  c2 


(7.59) 


From  the  gain  formula, 

V0  =  gmRLVi  (7.60) 

Getting/from  (7.59)  and  a  from  (7.60)  and  multiplying  them  together,  we  have 

C, 


af  =  gnfiL 


C1+C2 


(7.61) 
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Oscillation  frequency  is 


'Ci+c2 
Since  at  resonance  the  sum  of  impedance  around  the  loop  =  0, 


wo  =     , (7.62) 


1  (^  C 

/'co at  oscillation  frequency  (7.63) 


,/coL  Ci+C2 

Therefore 

1  1  1 


=  0  (7.64) 


/'coL     /coCi     ./'coC2 
:JXL  +  jXx  +jX2  =  0  (7.65) 


where 


or 


Xi  = 


1 

coCi 

1 

x2  = 

coC2 

1 

/'C0C1C2 

jd)L 

C1+C2 

(7.66) 

Hence  we  see  that  at  oscillation  we  create  a  negative  impedance  from  C\,  C2 
whose  value  is  equal  and  opposite  to  the  impedance  of  L.  Accordingly,  the 
condition  of  oscillation  can  also  be  explained  using  this  negative  impedance. 

Next  let  us  redraw  Figure  7.36  in  Figure  7.37,  where  there  is  a  change  in  location 
of  ground.  Now  it  becomes  the  common  source  (CS)  Colpitts  oscillator. 

Furthermore,  let  us  redraw  Figure  7.37  in  Figure  7.38.  It  is  now  relatively  easy  to 
see  why  this  is  an  oscillator. 

First,  from  Figure  7.38  we  can  see  that  the  reverse  transmission  transfer  function 
is  given  by 

Vi  _      JC0C2      _         1  ,7  fi7^ 

V0";coI+^"l-co2£C2 

At  oscillation  frequency,  co  becomes 

coo  -     j  (7.68) 

1    C\C2 

C1+C2 
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Fig.  7.37    Common  source 
Colpitts  oscillator 
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Fig.  7.38  Common  source 
Colpitt  oscillator,  alternate 
orientation 


V. 


c2 


-VvV 


L 


M1 


^  C1 


\7 


S7 


Setting  (B=co0  in  (7.67)  and  substituting  (7.68)  in  (7.67),  we  have 


Yi 

Vn 


Ci 

c2 


am 


Hence,  at  co0  there  is  exactly  180°  phase  shift  in  this  network,  making  the  total  phase 
shift  360°,  or  satisfying  the  condition  for  oscillation. 


7.6.3     Tuning,  Kv 


How  do  we  perform  tuning?  One  way  is  to  use  so-called  varactor  tuning.  This  is 
achieved  by  adding  a  variable  capacitor  in  series  with  the  original  capacitor. 
For  example  C\  in  Figure  7.38  is  replaced  by  a  series  combination  of  C\  and  Cv, 
as  shown  in  Figure  7.39.  Cv  is  the  capacitance  from  a  varactor. 
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Fig.  7.39    Tuning  circuit  for  I 

LC  oscillator  j C<        ^     C*  t 


V„m.  =  V , 


(a)  <b) 


The  varactor  is  basically  a  reverse  biased  pn  junction  diode  whose  depletion 
capacitance  is  a  function  of  voltage.  Even  though  popular,  this  type  of  tuning  does 
not  have  a  wide  tuning  range  and  is  also  sensitive  to  temperature  variation.  We  now 
quantify  the  A^vco  of  such  an  arrangement.  We  start  from  (7.1)  and  rewrite  the 
definition  of  Kvco  with  units  in  rad/s: 

d(o„        da>„  df„ 

Kvc0  =  — -  = =  2ti  —~  (7.70) 

dVif        dVtune  dVtune 

Whether  Kvco  is  in  units  of  deg/s/V  or  rad/s/V  should  be  evident  from  the  context. 
Now  from  pn  junction  theory  we  have, 

CV=       Cn>  (7.71) 


Here  CVo is  the  capacitance  at  zero  bias  and  Vtune,  the  tuning  voltage,  is  the  input  to 
the  VCO.  In  a  PLL  this  also  equals  V\t;  the  loop  filter  voltage.  4>0  is  a  constant  that  is 
around  0.6  V. 

Differentiating  (7.71),  we  have 


dCy  Cvo 


dVu 


24>o(V/l~ 


(7.72) 


According  to  Figure  7.39,  C\  in  Figure  7.38  becomes: 


Cx  +  cv 

Substituting  (7.73)  in  (7.68),  we  have 


Cl_-£%-  (7.73) 


«o  =  =  (7.74) 

£  CiC2Cv 

C\  C2+C1  C\-\-C2Cv 
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Differentiating  (7.74),  we  have 

dcoQ  _  Ci+C2 

dCv  ~  2 V(CiC2  +  CXCV  +  C2Cv)(CiC2CvL) 


\JC\Cy  +  C2Cy  +  CiC2 


2(v/^) 


Finally,  expanding  (7.70)  and  substituting  (7.75)  and  (7.72)  in  the  expansion,  we 
have 


fitoo  dCDfj     dCy 

Kv, 


^vco 


dVmne      dCv  dVXune 
_  Cyo 

( Ci  +  c2 yciCy  +  CzCy  +  CiCzN 

X  \2y/(CiC2  +  CiCv  +  C2Cv)(CiC2CvL)  2(VZAC2")3  / 

(7.76) 

What  are  the  design  considerations  of  Kvcol  Kvco  in  general  should  be  designed 
to  be  as  small  as  possible.  However,  it  must  be  large  enough  that  fa  can  span  the 
whole  frequency  range  with  a  tuning  voltage,  Vlf,  that  is  within  the  power  supply. 
Kyco  should  be  made  small  because  the  varactor  is  connected  to  the  LC  tank  via  a 
small  fixed  capacitor.  If  Kvco  is  large,  then  this  coupling  capacitor  is  large  and  the 
varactor  has  a  large  influence  on  the  resonant  frequency  of  the  LC  tank.  In  addition, 
the  varactor  itself  has  a  low  Q  factor,  in  particular  when  compared  with  the  inductor 
or  capacitor  in  the  oscillator.  This  is  due  to  the  resistance  in  the  varactor  itself  (on 
the  order  of  1  Q)  and  also  due  to  packaging.  A  large  varactor  influence  (due  to  a 
large  A\,co)  and  a  low  Q  varactor  mean  that  the  varactor  resistance  is  translated 
across  to  the  tank  circuit,  and  this  would  reduce  the  Q  of  the  tank  significantly. 


7.7     Ring  Oscillators 

We  now  turn  our  attention  to  the  second  oscillator  category:  waveform-based 
oscillators.  To  illustrate  its  nature  we  focus  our  discussion  on  one  specific  type: 
ring  oscillators.  A  ring  oscillator  is  shown  in  Figure  7.40.  Notice  that  this  structure 
also  employs  positive  feedback  to  achieve  oscillation.  However,  because  switching 
is  involved,  it  can  no  longer  be  treated  as  an  LTI  system  and  all  the  results 
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Fig.  7.40    Block  diagram  of  ring  oscillator 
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Fig.  7.41    (a)  Differential  implementation  of  ring  oscillator,  (b)  Corresponding  waveform 


developed  in  sub-section  7.5.2  are  no  longer  applicable.  We  now  study  this  using 
large-signal  time  domain  analysis. 

Let  us  start  with  Figure  7.41a,  which  depicts  an  N- stage  ring  oscillator  realized  using 
differential  cells  (which  have  complementary  outputs).  A  source  coupled  pair  (SCL) 
inverter  will  be  a  typical  implementation.  Assume  that  at  time  t0  the  output  of  stage  1 
changes  to  logic  1  (denoted  by  edge  a),  as  shown  in  Figure  7.41b.  When  this  logic 
1  propagates  to  the  end,  it  creates  a  logic  1  at  the  Nth  stage,  which,  when  fed  back  to  the 
input  of  the  first  stage,  creates  a  logic  0  in  the  first  stage  output.  This  is  edge  b.  When  this 
logic  0  is  propagated  through  the  chain  again,  it  toggles  the  output  of  stage  1  and 
triggers  edge  c.  Notice  that  it  takes  two  passes  through  the  chain  to  complete  a  period 
(each  pass  generates  an  edge  transition,  and  we  need  an  up  transition  followed  by  a 
down  transition  to  complete  a  period).  Denoting  tp  as  the  propagation  delay  through 
each  stage,  then  period  T  =  2Ntp.  For  a  single-ended  output  cell,  N  has  to  be  odd,  but 
for  a  differential  cell  N  can  be  odd/even,  to  start  an  oscillation.  There  is,  of  course, 
a  minimum  N.  The  minimum  N  depends  on  the  co0  and  tp  available  from  a  given 
technology.  Assuming  that  1 80°  of  phase  shift  is  provided  by  the  chain  of  N  stages,  each 
stage  must  provide  1^-  of  phase  shift  and  sufficient  gain  (so  that  the  overall  gain  is 
greater  than  1)  at  co0.  This  usually  means  that  the  gain  of  each  individual  stage  has  to 
be  greater  than  1  as  well. 
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7.7.1     Delay  Cells 

There  are  many  features  that  differentiate  the  delay  cells  used  in  a  ring  oscillator.  The 
most  important,  perhaps,  is  the  slew  time.  Contrary  to  popular  belief,  it  is  actually  the 
slew  time  (a  dynamic  parameter),  rather  than  the  interstage  gain  (a  static  parameter), 
that  determines  the  overall  phase  noise  performance  [4].  Along  this  line,  therefore, 
we  categorize  the  delay  cells  of  a  ring  oscillator  into  three  different  types  [16].  The 
first  one  is  a  fast-slewing  saturated  delay  cell  [17].  This  delay  cell  has  fast  rise  time 
and  fall  time.  It  also  performs  full  switching  and  therefore  belongs  to  the  saturated 
class  of  delay  cell.  An  example  that  consists  of  one  voltage-based  inverter  plus  a 
Schmitt  trigger  and  a  buffer  is  shown  in  Figure  7.42.  Here  the  delay  time  is 
determined  by  the  amount  of  current  supplied  through  the  current  source,  the  input 
capacitance,  and  the  Schmitt  trigger  threshold.  The  current  supplied  is  adjusted  by  the 
tuning  voltage  Vtune-  The  Schmitt  trigger  gives  you  fast  rising  and  falling.  As  shown, 
each  PMOS  and  NMOS  device  of  the  input  inverter  completely  turns  off  upon 
switching.  Figure  7.43  shows  the  output  voltage  swings  of  three  such  adjacent 
delay  cells.  To  understand  Figure  7.43,  the  three  dotted  lines  are  inputs  Vm\. .  .Vin3 

and  the  three  solid  lines  are  output  of  the  input  inverters,  Vjnvi Vinv3-  Let  us  start 

from  Vm\  (the  first  dotted  line)  and  assume  that  it  drops  abruptly.  The  inverter  takes 
times  to  change.  Therefore,  V;nvl  (the  first  solid  line)  rises  slowly.  As  soon  as  it  hits 
the  threshold  of  the  Schmitt  trigger,  VSChmitti  rises  abruptly,  resulting  in  Voutl  rising 
abruptly.  Now  this  Voutl  is  connected  to  Vin2  and  so  Vm2  rises  abruptly  again  (second 
dotted  line).  Vm2  has  the  same  steepness  as  Vm\  but  is  delayed  from  Vm\  by  a  certain 
period.  The  process  repeats  itself,  as  V;nv2  (second  solid  line)  responds  to  Vin2  and 
drops  slowly.  Therefore,  Vimi  is  delayed  from  V;nvi  by  the  same  delay  as  Vm2  is  from 
Vm\.  Finally,  the  process  is  repeated  for  the  third  stage,  whose  input  is  V^  (third 
dotted  line)  and  whose  inverter  output  is  VmyT,  (third  solid  line).  In  general,  we  can 
make  the  following  observations: 

1 .  V;nv  changes  slowly,  whereas  Vm  changes  abruptly 

2.  When  we  go  from  one  stage  to  the  other,  Vm  and  Vim  change  in  the  opposite 
direction. 

3.  Vinv  starts  to  change  when  Vout  from  the  previous  stage  hits  the  voltage  rail 
(either  V^d  or  ground). 

4.  Vschmitti  and  hence  Vout  (and  the  next  stage  Vm)  start  to  change  when  V;nv  crosses 
^threshold,  the  threshold  voltage  of  the  Schmitt  trigger.  Usually  the  Schmitt  trigger 
has  two  thresholds.  However,  to  simplify  the  present  explanation,  only  one 
threshold  is  assumed. 

The  present  design  is  a  fast-slewing  delay  cell  because  of  the  rapid  switching  of 
Vollt,  made  possible  by  the  use  of  Schmitt  trigger.  Another  variation  of  this  type  of 
delay  cell  consists  of  only  one  inverter  without  buffer. 

The  second  type  of  delay  cell  is  a  slow-slewing  saturated  delay  cell.  An  example  of 
this  type  is  shown  in  Figure  7.44.  Here  the  inverter  consists  of  a  source  coupled  pair 
(SCP)  and  hence  this  is  a  current-based  inverter.  In  this  case,  full  switching  also  occurs. 
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Fig.  7.44    Slow-slewing  saturated  delay  cell 

Therefore,  this  delay  cell  belongs  to  the  saturated  class  of  delay  cell,  just  like  the 
one  described  in  Figure  7.42.  On  the  other  hand,  it  is  called  slow  slewing  because 
it  has  a  longer  gate  delay.  Because  of  the  square  law  drain  current  characteristics, 
the  delay  is  inversely  proportional  to  the  voltage  swing.  For  a  low  supply  voltage 
Vdd.  the  voltage  swing  is  typically  less  than  1  V,  making  the  delay  much  longer 
than  the  fast-slewing  cell  of  Figure  7.42.  Another  difference  from  the  fast-slewing 
cell  is  that  here  the  load  consists  of  resistors  realized  by  biasing  the  transistor  into 
the  triode  region.  Because  this  resistor  is  a  function  of  drain  current  (which  is  not 
true  for  a  real  physical  resistor),  the  RC  time  constant  and  hence  the  f0  can  be 
tuned  by  varying  /tune.  This  is  discussed  further  later. 

Figure  7.45  shows  the  output  voltage  swings  of  three  such  adjacent  delay  cells. 
To  understand  this,  the  three  dotted  lines,  corresponding  to  the  three  outputs 
(=  inputs  of  the  subsequent  stages)  Vouto...Vout2  are  shown.  Compared  with  the 
dotted  lines  of  Figure  7.43  (which  correspond  to  the  outputs  of  the  fast-slewing 
cells),  notice  that  because  of  the  extra  delay  and  the  lack  of  Schmitt  trigger,  the 
delay  is  longer  and  the  rise/fall  time  is  longer.  Nonetheless,  this  delay  cell  is  still 
saturated  because  the  output  goes  all  the  way  between  Vdd  and  ground,  which  means 
that  the  devices  are  completely  turned  on/off. 

The  third  type  of  delay  cell  is  the  non-saturated  delay  cell  [18],  as  shown  in 
Figure  7.46.  This  is  also  a  voltage  inverter  based  delay  cell.  Compared  with 
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Fig.  7.47    Waveform  of  3  cascaded  non-saturated  delay  cells 


Figure  7.42,  it  does  not  have  a  Schmitt  trigger  to  enable  fast  turning  on/off. 
Compared  with  both  Figure  7.42,  and  Figure  7.44,  the  transistors  M1;  M2,  M3,  M4, 
M$,  Mg  never  fully  turn  on/off.  As  a  result,  as  shown  in  Figure  7.47,  the  output 
waveforms  VOut0-  ■  -Vout2  never  reach  V^  or  ground  which  is  why  this  type  of  delay 
cell  is  called  non-saturated. 
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7.7.2     Tuning 

Now  that  we  have  discussed  all  three  types  of  delay  cells,  we  discuss  how  we 
calculate  f0  and  A"vco.  First  f0  is  given  by 

^=4  (7'77) 

where  N  =  number  of  stages  and  tp  —  delay  of  each  stage. 
Next  we  want  to  calculate  KyCo-  First  let  us  repeat  its  definition  from  (7.70): 

KVCo  =  2n-^-  (7.78) 

"'tune 

At  this  point  we  would  like  to  differentiate  ring  oscillators  into  two  classes, 
namely  one  whose  delay  cell  uses  an  active  load  (e.g.,  the  delay  cell  in  Figure  7.42) 
and  one  whose  delay  cell  uses  a  resistive  load  (e.g.,  the  delay  cell  in  Figure  7.44). 
The  delay  cells  differ  in  the  functional  dependence  of  tp  on  circuit  components  and 
tuning  voltage.  We  will  derive  the  f0  and  Kvco  for  both  classes. 


7.7.2.1     #vco:  Delay  Cell  with  Active  Load 

We  start  with  the  class  whose  delay  cell  uses  active  load.  Using  Figure  7.42  as  an 
example,  here  at  the  V;nv's  node, 

/load  =  <:,*=  (7.79) 

'p 

Here  C/  is  the  load  capacitor  at  Vinv,  and  VSWing  is  Vinv's  voltage  swing.  Substituting 
(7.79)  into  (7.77),  we  have 

f0  = — (7.80) 

Taking  differentials  on  both  sides,  we  have 

df0  =      dllmd  (7.81) 

Jo      2C,VswingJV 

From  Figure  7.42,  /ioad  =  /tune  and  so  dIloai  =  dltune.  Hence 

=         gtune 

Jo        2C,VswingN 
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Fig.  7.48  Simplified  slow- 
slewing  saturated  delay  cell 
circuit  schematic 


v\n 


i^'tail    'bias  'tune 

i 


Finally,  substituting  (7.82)  into  (7.78), 

dfo  „  "Aline 


^VCO  =  271 


dVu 


=  2n- 


2C,VswmsNdVtl 


2nG„ 


2C,VswinJV 


(7.83) 


/ "  swing-' 


The  last  equality  comes  about  because  definition  G„,=dItunJdVtune,  where  G„,  is 
the  transconductance  of  tuning  transistor  M5,  M6. 


7.7.2.2    Kvco:  Delay  Cell  with  Resistive  Load 

Next  we  turn  to  the  class  whose  delay  cell  uses  resistive  load.  This  is  the  class  used 
for  the  rest  of  this  book.  We  use  Figure  7.44  as  an  example  to  facilitate  discussion. 
We  redraw  Figure  7.44  in  Figure  7.48,  where  loads  M3,  M4  are  represented  by  real 
resistors  Ru  R2.  C\,  C2  are  load  capacitors  (from  device,  parasitic  capacitance) 
hanging  at  Vout+,  Vout'. 

First  let  us  derive  f0.  To  derive  this  we  know  from  (7.77)  that  we  need  tp.  Hence 
we  would  now  derive  tp,  which  is  the  time  between  the  zero  crossings  of  Voul.  When 
do  zero  crossings  occur?  As  an  example,  when  Vin  is  positive,  Vm  switches  7taii  from 
M2  toMx.  Hence 
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Fig.  7.49    Left-hand  side  of 
the  source  coupled  pair  in  a 
slow-slewing  saturated  delay 
cell  during  switching 


Ri=R 


tail 


Vout+ 


Ci=C   ■ 

I 


Vout+goes  from  Vdd  to  Vdd  -  Vswing 
Vour  goes  from  Vdd  -  Vswing  to  Vdd 
The  zero  crossing  occurs  when 

Vout      =  Voul      =  Vdd  —  (Vswing/2) 


(7.84) 

(7.85) 

(7.86) 


Let  us  calculate  the  time  between  the  zero  crossings.  First  we  note  that  for  any 
first-order  RC  circuit  with  output  voltage  Vout,  we  have 

Vout(t)  =  Vou,(final)  +  [Vout(initial)  -  V«t(final)]  exp(-r//?C)  (7.87) 

Let  us  now  redraw  left-hand  side  of  the  circuit  in  Figure  7.48  in  Figure  7.49. 
Here  we  assume  that  Vin  is  positive  so  that  all  of  7tail  is  switched  to  Mx.  Hence  we 
replace  M\  by  a  current  source  with  value  /taii.  Next  we  assume  that  R1=R2=R  and 

d=C2=C. 

Focusing  on  Figure  7.49,  we  note  that  it  is  a  first-order  RC  circuit  and  we  can  use 
(7.87)  to  find  Vout+.  To  apply  (7.87)  we  need  to  find  V/out+(initial)  and  Vout+(final). 
From  (7.84)  we  know  that 


Vou,+ (initial)   =  Vdd 


(7.88) 
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Fig.  7.50    Right-hand  side  of 
the  source  coupled  pair  in  a 
slow-slewing  saturated  delay 
cell  during  switching 
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Vout+ (final)    =  Vdd  -  Vswing  =  Vdd  -  7taii/? 

Substituting  (7.88)  and  (7.89)  into  (7.87),  we  have 

Vout+(0  =  Vdd  -  Kswing  +  (Vdd  -  VM  +  Vswing)exp(-t/RC) 
=  Vdd  -  Vswmg(l  -  exp(-t/RC)) 

Vout+(t)  =  Vdd  -  Vswing  +  (Vdd  -  Vdd  +  Vswing)^ 


(7.89) 


;  '  R<    ' 


=  Vdd  —  Vswing(l 

From  (7.86),  zero  crossing  occurs  when 

Vout(f)  reachesVdd  -  Vswing/2 
Substituting  (7.91)  into  (7.90),  we  have 

Vdd-Vswing/2   =  Vdd-  Vswing(l    -  exp(-tp/RC)) 
Solving,  we  have 


tp  =  RCln2 


(7.90) 


(7.91) 


(7.92) 


(7.93) 


To  be  consistent,  let  us  rederive  this  by  using  the  right-hand  side  of  the  circuit. 
We  redraw  the  right  hand  side  of  the  circuit  in  Figure  7.48  in  Figure  7.50. 
Remember  that  we  have  assumed  that  Vm  is  positive  and  so  M^  is  off.  Since  Mi  is 
off,  it  is  not  shown  in  Figure  7.50. 
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Focusing  on  Figure  7.50,  again  we  note  that  it  is  a  first-order  RC  circuit  and 
hence  we  can  apply  (7.87)  to  find  Vout~.  From  (7.85)  we  have 

'  out(initial)  =  ^dd  —   'swing  (7. 94) 

and 

K,ut(final)  =  Vdd  (7.95) 

Substituting  (7.94)  and  (7.95)  into  (7.87)  we  have 

Voui-(t)   =  Vdd  +   (Vdd  -  Vswmg  -  Vdd)  exp  (-t/RC) 

=  Vdd  -  Vswingexp  (-t/RC)  (7.96) 

From  (7.86),  zero  crossing  occurs  when 

Vout(t)  reachesVdd  -  Vswing/2  (7.97) 

Substituing  (7.97)  into  (7.96),  we  have 

Vdd  -  VSwing/2   =  Vdd  -  Vswlng(l    -  sxp(-tp/RC))  (7.98) 

Solving  again  we  have: 

tp  =  RCln2  (7.99) 

Now  that  we  have  derived  tp,  let  us  go  back  and  substitute  (7.93)  or  (7.99)  into 
(7.77)  to  get/c: 

f0  =    l/(2NRCln2)  (7.100) 

Next  we  want  to  derive  Kvco.  To  find  Kvco,  take  differentials  on  both  sides  of 
(7.100): 

d^=^kr2WdR  (7-101) 

To  find  f0's  dependence  on  Itune,  we  express  dR  in  terms  of  dltune. 

1  dR  -L    dR 

df°  =  "  2^VCln2  dl^e  *-  ^  If  dl^dh-  (?-102a) 

Alternately,  (7.102a)  can  be  expressed  in  terms  of  dVtune: 

df°  =  oD2>,n    o  Ti —  G^^tune  =  -£  -v. —  GmdVbme  (7. 102b) 

2R2NC  In  2  dltune  R    dltnne 
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Here  Gm  is  the  transconductance  of  the  Vtune-/tune  converter.  This  Vtune-Aune 
converter  was  shown  in  Figure  7.44.  Referring  to  Figure  7.44,  G„,  is  simply  the 
g„,  of  transistor  Mtimel  in  Figure  7.44,  denoted  as  gmMluncl  (we  assume  the  current 
mirror  Mtuns-Mtune3  has  a  gain  of  1).  How  about  ^-  ?  To  find  this  again  we  refer  to 
Figure  7.44.  There  are  two  cases:  the  first  is  when  P^s  is  a  constant,  and  the  second 
is  when  P^ms  varies  in  a  way  that  maintains  a  constant  VSWing  across  M3  and  M4. 

Case  1 :  P^ms  =  constant 

Since  M3,  M4  are  biased  in  the  triode  region,  we  know  that 

1 
R  ~  kp(W/L)3(VgsJ  ~V,~  Vds3)  (7-103) 

Differentiating  (7.103)  with  respect  to  /tune  and  noting  that  dVds/dIlane  =  R,  we 
have 


(7.104) 


dR 

^"tune 

R2 

Vgs3  - 

-vt- 

Vds3 

Substituting 

(7.104) 

into 

(7.102a). 
dfo   _ 

"'tune 

-f0R 

Vgs3  ~ 

-vt- 

Vdsl 

Substituting 

(7.104) 

into 

(7.102b) 

dfo 

~fo 

RgmMtunel 

dVtune 

V&  ' 

-v,- 

'  Vds3 

(7.105a) 


(7.105b) 


Substituting  (7.105b)  into  (7.78),  we  have, 


-2nf0RgmMtune\       —2nf0 RgrriMtunei  ,n  ,^, 

Avcn  = w (/.lOo) 

Vga  -V<~  Vds3       VGS3  -Vt-  VDS3 

The  last  equality  occurs  because  we  assume  that  Vgs,  V^  do  not  change  much. 
Substituting  (7. 103)  in  (7.106), 

*vco  = -2*/°g"Wi (7.107) 

kp(W/L)3(VGs3-Vt-VDS3y 

Case  2:  Kswing  is  a  constant. 

^swing  is  set  constant  by  a  circuit  called  the  replica  bias.  The  idea  is  that  we  use  a 
feedback  loop  that  sets/„  to  be  constant,  irrespective  of  temperature,  power  supply, 
and  process  variation.  We  call  this  coarse  tuning.  This  can  also  be  used  to  do  fine 
tuning:  that  is,  changing  f()  as  a  function  of  Vtime-  In  practice,  this  poses  some 
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additional  problems  because  it  will  put  the  replica  bias  circuit  inside  the  PLL. 
Hence  extra  poles  introduced  by  this  circuit  will  compromise  the  dynamics  of  PLL. 
If  we  do  indeed  choose  to  use  the  replica  bias  circuit  to  perform  fine  tuning,  then  we 
can  also  derive  Kvco  with  this  method. 
First  we  note  that,  by  design, 

Vswing  =  ^Aaii  =   constant  (7.108) 

Differentiating  (7.108)  with  respect  to  7tune, 

dR  R  (7.109) 


"*tune  Aune 

Substituting  (7.109)  into  (7.102b)  and  simplifying, 

dfo  fo 


,,,  ,        SmMmM  (7.110) 

" 'tune         *tune 

Substitute  (7.110)  into  (7.78),  we  finally  have 

Kvco=^Wl  (7-111) 

^tune 

As  with  A"vco  of  an  LC  oscillator,  Kvco  of  a  ring  oscillator  should  also  be  designed 
as  small  as  possible.  Of  course,  it  must  also  be  large  enough  that  fa  can  span  the 
whole  frequency  range.  Why  is  a  small  Kvco  also  desirable  in  the  case  of  a  ring 
oscillator?  In  this  case  since  the  propagation  delay  is  a  function  of  C/,  VSwing>  and 
A>ias,  we  can  tune  using  C/  (as  in  the  varactor  case)  or  using  VSWiBg  (by  changing  the 
load  resistance)  or  using  7bias.  It  turns  out  that  tuning  using  /bias  is  the  more 
convenient  choice.  In  the  case  of  tuning  by  varying  /bias,  the  thermal  noise  in  /bias 
will  degrade  the  phase  noise.  The  degradation  becomes  worse  as  Kvco  becomes 
larger;  hence  it  is  also  desirable  to  keep  Kvco  small. 


7.8     Introduction  to  Phase  Noise 
7.8.1     Definition 

What  is  phase  noise?  We  briefly  mentioned  that  in  Chapter  2  and  compared  it  to 
amplitude  noise.  Since  we  are  familiar  with  amplitude  noise,  we  start  by  repeating 
its  definition.  From  Chapter  2,  we  know  that  we  can  describe  voltage  noise  (a  type 
of  amplitude  noise)  by 


V2=SV(/)A/  (7.112) 
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Here  Sv(f)  is  the  power  spectral  density  in  V2/Hz  and  V2  is  the  average  of  the 
square  of  noise  voltage.  The  unit  of  measurement,  for  V2  is  volt  .  When  we  extend 
this  definition  to  the  definition  of  phase  noise,  we  have 

W=SB(f)Af  (7.113) 

Here  Sg(/)  is  the  power  spectral  density  (PSD)  of  phase  at  modulation  frequency 
/,  and  6  is  the  average  of  the  square  of  phase  deviation  for  frequencies 
from  /  —  |  A/  to  /  +  j  A/.  The  unit  for  B  is  rad2  or  cycles2  and  for  Sq  is  rad2/Hz 
or  cycles2/Hz.  For  example,  if  Sq  is  8  cycles2  /Hz,  the  average  of  the  square  of  phase 
deviation  in  a  2-Hz  band  is  16  cycles.  If  we  pass  such  a  signal  through  a  phase 
detector,  it  will  produce  a  current  power  spectrum  from  a  phase  power  spectrum  and 
phase  noise  will  be  converted  into  current  noise. 


7.8.2    Interpretation 

In  this  sub-section  we  clear  up  a  subtle  point  that  relates  phase  noise  and  amplitude 
(voltage  or  current)  noise.  As  a  start,  let  us  assume  that  the  oscillator  has  a 
sinusoidal  output  S0(t)  (like  the  one  given  in  Figure  7.20)  that  is  now  corrupted 
by  phase  noise.  Hence  the  form  becomes 

S0(t)   =  cos(cM  +  A0(f))  (7.114) 

Here  cot.  is  the  oscillation  or  carrier  frequency,  and  A9(f)  is  the  random  phase 
fluctuation  due  to  noise  inside  the  oscillator.  We  assume  that  the  amplitude  of  S„(t) 
is  1.  Let  us  further  assume  that  A6(?)  has  a  spectral  density,  denoted  as  A6(/),  as 
shown  in  Figure  7.51.  (IA9(/)I2  is,  of  course,  the  PSD  of  the  phase  noise.)  This  is  a 
baseband  spectrum  and  is  plotting  the  phase  variable,  AG(/),  as  a  function  of 
frequency.  Next  we  will  represent  this  spectral  density  by  individual  frequency 
impulses.  As  an  example,  let  us  refer  to  Figure  7.52,  where  two  pairs  of  the  impulses 
are  shown,  at  frequencies  ±/ml,  ±/m2.  Using  the  principle  of  superposition,  let  us 
apply  each  of  these  individual  pairs  of  frequency  impulses  to  (7. 1 14).  We  start  with 
the  pair  A6(/„;1),  —  A9(/„;1),  with  frequency  impulses  at  ±/ml.  They  are  represented 
in  the  time  domain  as: 

A0/mi(t)  =AAe(/„,1)cos(fiWr)  (7.115) 

We  then  substitute  (7.115)  in  (7.114): 

S0(t)  =  cos  (cof?  +  AA9(/„,1)cos(a)„,1f))  (7.116) 
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0  ^f 

Fig.  7.51    Oscillator  phase  noise  spectral  density,  phase  representation 


AA6(L) 


AAe(tm2) 


Fig.  7.52    Equivalent  representation  of  oscillator  phase  noise  spectral  density,  phase  representa- 
tion, using  frequency  impulses  at  ±/mi,  ±/m2 


We  can  see  that  (7.116)  represents  phase  modulation.  Next  we  assume  that  the 
phase  fluctuation  is  small  and  so  AQfml(t)  is  small,  which  means  that  A^eC/mi)  is 
small  as  well.  Hence  this  phase  modulation  is  a  narrowband  phase  modulation, 
which  can  be  approximated  by  amplitude  modulation.  Accordingly,  50(t)  in  (7.1 16) 
consists  of  A  a  e(/ml)cos  ((co£.±coml)f)  and  cos  {(oct)  [15].  This  is  shown  in  Figure 
7.53.  We  can  see  that  they  become  bandpass  signals,  centered  at/t..  Moreover,  the 
phase  variable  AG(/)  is  transformed  to  an  amplitude  variable,  S0(f).  Looking  at 
Figure  7.53,  we  can  see  that  Sa(f)  consists  of  the  carrier  at/£.  and  two  frequency 
impulses  spaced  ±/ml  from  fc,  with  the  amplitudes  being  A&  e(/mi)- 

Next  let  us  repeat  the  procedure  for  the  pair  A9(/„,2),  A9(—  fm2)  the  frequency 
impulses  at  ±/„,2,  as  shown  in  Figure  7.52.  Then  (7.114)  becomes 


S0(t)    =   cos  (coff  +  AAe(/m2)cos((Bm2f)) 


(7.117) 
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Fig.  7.53    Frequency  impulses  at  the  oscillator  output  due  to  frequency  impulses  injected  at  ±/mi 
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Fig.  7.54  Frequency  impulses  at  the  oscillator  output  due  to  frequency  impulses  injected  at  ±/„,i, 
±/m2-  The  envelope  formed  by  connecting  the  peaks  of  these  frequency  impulses  at  the  oscillator 
output  becomes  the  oscillator  phase  noise  spectral  density,  amplitude  representation 


This  is  again  narrowband  phase  modulation  and  hence  S0(t)  consists  of  the 
carrier  at  fc  and  two  frequency  impulses  spaced  ±fm2  away  from  fc,  with  the 
amplitudes  being  A&  e(/»i2)-  If  we  add  these  new  frequency  impulses  to  Figure 
7.53,  we  have  Figure  7.54.  Furthermore,  let  us  connect  the  peaks  of  these  frequency 
impulses  by  two  dotted  lines,  as  shown  in  Figure  7.54.  These  dotted  lines  are  called 
the  envelope.  Now  let  us  compare  the  envelope  in  Figure  7.54  with  the  spectral 
density  in  Figure  7.52.  We  note  that  they  have  the  same  shape.  The  difference,  of 
course,  is  that  Figure  7.52  is  a  baseband  plot  and  is  plotting  the  spectral  density  of 
the  phase  variable,  whereas  Figure  7.54  is  a  bandpass  plot  (or  RF)  and  is  plotting  the 
spectral  density  of  the  amplitude  variable.  Strictly  speaking,  Figure  7.52  is  the 
spectral  density  of  the  true  phase  noise.  However,  because  the  plot  in  Figure  7.54 
has  the  same  shape  as  that  in  Figure  7.52,  it  can  be  used  to  represent  indirectly  the 
spectral  density  of  the  phase  noise  as  well.  Moreover,  Figure  7.54  represents  what 
we  can  actually  measure  (amplitude).  From  now  on,  we  will  use  either  representa- 
tion as  suited  and  we  will  use  5'e_vco(/)  to  denote  the  PSD  of  both  representations  of 
the  phase  noise  in  a  VCO.  The  unit  in  phase  domain  representation  is  usually  given 
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in  rad2/Hz,  while  the  unit  in  amplitude  domain  representation  is  usually  given  in 
dBc/Hz.  Which  representation  it  refers  to  should  be  evident  from  the  context  in 
which  5e_vco(/)  is  used. 


7.8.3    Basic  Phase  Noise  Models 

In  this  sub-section  we  discuss  the  phase  noise  behavior  of  some  specific  VCOs. 
Phase  noise  comes  from  the  noise  (both  thermal  and  shot  noise)  of  the  internal 
transistors  and  resistors.  Rough  first  order  estimate  of  phase  noise  of  a  resonator- 
based  VCO,  such  as  the  LC  tank  VCO,  can  be  carried  out  by  continuing  to  assume 
the  VCO  to  be  a  LTI  system.  We  will  have  a  chance  to  do  that  in  the  Problem  7.7. 
More  sophisticated  treatment  of  phase  noise  calls  for  treating  the  resonator  based 
VCO  as  a  LPTV  system,  similar  to  what  has  been  done  in  mixer's  noise  analysis  in 
Chapter  4  and  Chapter  5  and  in  the  simulator  SpectreRF.  For  waveform  oscillators 
such  as  a  ring  oscillator,  phase  noise  is  best  calculated  in  the  time  domain  and  then 
its  frequency  response  is  calculated  using  the  autocorrelation  method  [4].  We  will 
now  discuss  the  phase  noise  of  ring  oscillators. 

Let  us  first  compare  the  phase  noise  of  the  delay  cells  in  a  ring  oscillator.  For  the 
fast-slewing  saturated  delay  cell  of  Figure  7.42,  the  noise  process  is  not  stationary 
since  the  devices  change  state.  First,  each  MOS  device  turns  off  completely  once  in 
every  cycle  of  the  ring  oscillator  and  no  noise  is  generated  for  those  periods. 
Second,  when  the  individual  MOS  device  turns  on,  the  noise  process  affects  the 
delay  time  of  each  cell  up  to  the  time  instant  when  Vout  crosses  Vthreshoid  of 
the  next  delay  cell.  Again  that  is  because  once  Vout  crosses  ^threshold*  the  inverter 
of  the  next  delay  cell  switches  state. 

For  the  slow-slewing  saturated  delay  cell  of  Figure  7.44,  full  switching  also 
occurs  and  the  noise  process  is  still  non-stationary.  However,  the  noise  process 
affects  the  delay  long  after  the  time  instant  when  Vout  crosses  Vthreshoid  [4,16]. 

Finally,  for  the  non-saturated  delay  cell  of  Figure  7.46,  because  transistors  are 
never  turned  off,  the  noise  process  can  be  approximated  as  a  stationary  process.  For 
calculation  of  noise,  each  delay  cell  can  be  modeled  as  a  linear  amplifier.  Compar- 
ing the  three  classes  of  ring  oscillators  based  on  the  three  types  of  delay  cells,  the 
one  in  Figure  7.46  has  the  worst  phase  noise  performance.  In  general,  in  terms  of 
phase  noise  performance  the  fast-saturated  type  is  better  than  the  slow- saturated 
type,  which  in  turn  is  better  than  the  non-saturated  type  [16]. 

In  this  section  let  us  concentrate  on  the  ring  oscillator  based  on  the  slow 
saturated  type.  We  assume  that  the  noise  is  dominated  by  thermal  noise  in  transistor 
Mi,  M2  in  this  type  of  delay  cell  (shown  in  Figure  7.44).  We  first  present  the  phase 
noise  PSD  of  this  ring  oscillator,  which  is  given  as  [4] 

Se_vco(./i)  =  (fj)         f"^      .  (7.118) 
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Here  N  is  the  number  of  stages,/  is  the  frequency  of  interest  (as  offset  from/0), 
and/o  is  the  oscillator  frequency.  For  the  rest  of  the  parameters  let  us  refer  to  Figure 
7.44.  ar  is  the  voltage  gain  of  the  differential  pair  M\,  M2.  77  is  a  constant  that 
depends  on  the  relative  amount  of  time  that  transistor  Mx  is  in  saturation  and  triode 
region.  /bias  is  the  bias  current,  and  Vast  —  V,  is  the  overdrive  voltage  of  Mi.  Notice 
that  5'e_vco(/i)  has  bandpass  characteristics  around  coo  =  2rc  xf0.  This  shape  is 
called  Lorentizian  and  falls  off  as  jj  from  the  center  frequency. 

From  (7.1 18)  notice  that  as  power  dissipated  in  the  delay  cell  goes  up  /t,ias  times, 
Vest  —  Vt  goes  up  and  Se_vco(,/j)  goes  down.  Hence  there  is  a  trade-off  between 
power  and  phase  noise. 

Now  this  oscillator  is  embedded  inside  a  synthesizer,  so  to  calculate  the  PSD  of 
the  synthesizer  output  phase  noise  due  to  VCO  phase  noise,  denoted  as  5eovco(/i), 
we  need  to  know  the  transfer  function  from  the  VCO  to  the  synthesizer  output, 
which  we  denote  as  Hvco.  Using  Hvco,  Se0_vco(fi)  can  be  expressed  as 

^9o_vco  =  l^vcol  Sq  vco  (7.119) 

Hvco  depends  on  the  loop  filter.  The  loop  filter  will  be  discussed  in  Chapter  8. 


7.9     Advanced  Phase  Noise  Model  of  Ring  Oscillators 

Let  us  return  to  Figure  7.40,  which  shows  an  N-stage  ring  oscillator  consisting  of 
N  identical  delay  cells.  Elaborating  on  section  7.7.1,  and  also  following  [2,  3]  ring 
oscillators  can  be  categorized  as  having  saturated  and  unsaturated  outputs.  Ring 
oscillators  with  saturated  outputs  are  those  whose  delay  cells'  outputs  reach  the 
power  supply  Vdd  within  the  period  of  oscillation.  Ring  oscillators  with  unsaturated 
outputs  are  those  whose  delay  cells'  outputs  never  reach  Vdd  within  the  period  of 
oscillation.  Typically  in  ring  oscillators  with  saturated  outputs  the  transistors  in 
each  delay  cell  are  fully  switched.  However  in  ring  oscillators  with  unsaturated 
outputs,  depending  on  the  delay  cells'  design,  either  all  the  transistors  in  each  delay 
cell  remain  on  all  the  time  and  never  turn  off,  or  in  each  delay  cell  some  of  the 
transistors  never  turn  off  while  some  of  the  transistors  are  fully  switched.  In  the  case 
when  all  the  transistors  remain  on  all  the  time  the  ring  oscillator  is  denoted  as  a  non- 
saturated  ring  oscillator.  In  the  case  when  some  of  the  transistors  never  turn  off 
while  some  transistors  are  fully  switched  the  ring  oscillator  was  denoted,  in  this 
paper,  as  an  unsaturated  oscillator.  This  categorization  is  shown  in  Figure  7.55. 


7.9.1     Saturated  Ring  Oscillators 

In  the  past  ring  oscillators  with  saturated  outputs  have  been  reported  to  have 
better  phase  noise  characteristics  than  ring  oscillators  with  unsaturated  outputs  [3]. 
This  is  because  once  output  reaches  Vdd  in  a  cycle,  noise  is  resetted  and  there  is  no 
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Fig.  7.55    Categorization  of  Oscillators 
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correlation  in  noise  to  the  next  cycle.  To  make  oscillators  saturated,  one  way  is  to 
increase  the  number  of  stages,  N,  in  a  ring  oscillator.  As  N  increases,  the  oscillator 
outputs  eventually  become  saturated.  This  is  because  once  the  output  crosses  the 
threshold  and  triggers  the  next  stage,  it  has  time  to  ramp  up  to  Vdd  before  the  change 
propagates  through  the  chain  of  delay  cell,  feeds  itself  back  to  the  input  of  the  delay 
cell,  and  makes  the  output  ramp  down  again.  Increasing  N,  however,  increases 
hardware  and/or  decreases  the  oscillation  frequency. 

To  look  at  the  phase  noise  of  saturated  ring  oscillator  we  start  with  section  7.8 
above.  There  we  present  some  basic  phase  noise  theory.  Assumptions  were  made  to 
simplify  the  analysis.  These  assumptions  affect  the  accuracy  of  the  model  as  well  as 
design  insights. 

To  develop  a  more  advanced  phase  noise  model,  we  state  explicitly  these 
assumptions.  For  illustration  we  use  the  single  ended  oscillator,  as  shown  above 
in  Figure  7.40  whose  delay  cells  are  simple  inverters,  each  consists  of  transistors, 
M]  and  M2.  We  focus  on  the  rising  transition  of  Vol„,  where  an  example  simulation 
is  shown  in  Figure  7.57.  Vol<l  around  the  moment  of  switching  is  zoomed  in  and 
noise  is  visible,  tj  is  divided  into  two  regions:  t\  (M;  in  triode)  and  f2  (Mi  in 
saturation).  An  expression  for  phase  noise  of  this  single  ended  oscillator,  with 
simple  inverter  delay  cell,  was  derived  with  the  following  assumptions  [12]: 

1 .  When  Vout  crosses  the  threshold,  it  crosses  the  threshold  only  one  time. 

2.  Vj„  switches  instantly  to  0V  and  stays  constant  during  the  rising  transition  of 
Vout,  so  that  M\  is  replaced  by  a  current  source  (Figure  7.56a  ),  whose  mean 
(deterministic  part)  is  constant.  The  noise  part  of  the  current  source  has  constant 
power  spectral  density  (psd). 

3.  Referring  to  transistor  Mls  M2,  during  the  rising  transition,  it  is  assumed  that  M\ 
is  on  and  in  saturation  throughout,  while  M2  is  off,  as  shown  in  Figure  7.56a  i.e. 
it  essentially  ignores  the  fact  that  M\  typically  enters  triode. 

The  consequences  of  these  assumptions  are  now  discussed. 
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Fig.  7.56    Representation  for  simple  inverter  based  ring  oscillator 
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Fig.  7.57    Noise  simulation  (include  device  cap)  of  an  example  differential  pair  based  ring 
oscillator  with  physical  resistor  load 
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A)  Assumption  1 :  last  crossing 

Specifically  [28]  shows  that  assumption  (1)  (this  assumption  was  made,  for 
example,  in  [7])  can,  by  itself  alone,  underestimate  the  phase  noise  by  a  factor 
of  two.  This  is  because,  with  noise,  Vout  can  cross  the  threshold  more  than  once, 
as  shown  in  the  circle  labelled  multiple  crossing  at  the  upper  right  hand  corner  of 
Figure  7.57.  Specifically,  the  waveform  when  Vout  crosses  the  threshold  is 
enlarged  in  the  circle  at  the  upper  right  hand  corner.  Noise  is  now  visible.  Vout 
crosses  the  threshold  first  time  at  ?/,„,.  It  then  bounces  around  the  threshold  a 
couple  of  time  before  making  the  last  crossing  at  tfast,  which  is  the  actual 
switching  time,  f.™,,0/,.  This  increases  timing  jitter.  Because  only  the  last  crossing 
counts,  these  extra  crossings  should  be  included  in  calculating  the  time  jitter. 
These  extra  crossings  increase  timing  jitter  and  hence  phase  noise. 

A  bound  for  this  additional  jitter  can  be  shown  to  be  related  to  the  current 
noise/slew  rate  ratio  [29].  With  this  ratio  reduced  to  below  Pvy  z ,  and  for  the 
typical  CMOS  technology,  the  additional  phase  noise  has  been  minimized  to  a 
value  that  it  can  be  ignored. 

B)  Assumption  2:  time  scaling 

As  for  assumption  2,  one  consequence  of  making  this  assumption  is  that  one  can 
over-estimate  or  under-estimate  the  time  varying  slew  rate  and  noise  power 
spectral  density,  due  to  changing  Vin  and  Vout.  Hence,  for  example,  the  slew  rate 
is  not  constant,  as  shown  in  Figure  7.57.  [6]  does  try  to  incorporate  the  time 
varying  effect,  in  a  heuristic  manner,  by  making  the  timing  jitter  (per  delay  cell) 
equal  the  ratio  of  the  noise  to  slew  rate,  both  calculated  at  the  mean  time  of 
crossing.  However,  as  shown  in  [29],  this  heuristic  way  of  handling  time 
varyingness  is  problematic  since  any  history  of  the  noise  up  to  the  mean  time 
of  crossing  is  ignored,  which  can  significantly  change  the  answer.  [27]  handles 
assumption  (3)  by  the  concept  of  time  scaling. 

Let  us  review  the  concept  of  time  scaling  by  using  the  region  t2  in  Figure  7.57 
as  an  example.  The  resulting  approximated  output  voltage,  denoted  as  power 
law  approx,  is  shown  in  as  dotted  line  in  region  t2,  with  time  dependency  written 
as  t2p.  All  parameters  are  subscripted  with  "2"  because  M\  is  in  region  t2. 

Let  us  now  redraw  the  power  law  approx  in  Figure  7.58a.  The  time  depen- 
dency is  assumed  to  be  y/t  (i.e.  p=0.25).  Focusing  on  SR2,  because  it  is  not 
constant,  the  mean  of  Vout2,  Vollt2  ramps  up  with  a  non-constant  slope.  To  show 
this  we  now  label  the  time  axis,  t,  as  0,  1,  2,  3.  Notice  the  time  spacing  is 
uniform,  i.e.  change  in  time  from  t  =  0  to  t  —  1  is  identical  to  the  change  in  time 
from  t  =  1  to  t  =  2,  which  in  turn  equals  the  time  change  from  t  =  2  to  t  =  3, 
where  the  time  spacings  are  all  1  time  unit.  On  the  other  hand,  the  change  in 
Voua,  over  each  corresponding  time  spacing,  is  increasing.  To  see  this,  note  by 
definition,  ^f1  —  SR2(t).  As  stated  above,  SR2(t)  is  proportional  to  \ft.  For 
illustration,  we  set  the  proportionality  constant  to  1.  Then  SR2(t)  —  \ft.  This 
means  the  slew  rate  in  region  2  is  increasing  with  time  (from  S7?2(l),  at  t  =  1, 
to  S7?2(3),  at  t  —  3),  as  shown  in  Figure  7.58a. 
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Next  we  discretize  the  equation,  c^jf  =  SR2(t),  as  AV'°"^"A')  =  SR2{nAt), 
n  =  1,2...,  where  n  is  the  time  index.  Then  as  the  time  marches  from  1  =  0 
to  t  =  1  to  ?  =  2  . . .,  we  calculate  AVout2(nAt)  by  setting  At  =  1,  and  thus 
AVou,2(«Af)  =  AV0,„2(«)  =  S7?2(n)  X  1.  Then  at  n=l,  AVout2  is 
57?2(1)  X  1  =  vT  x  1  =  1  voltage  unit.  Next,  at  n=2,  AVOH,2  is 
SR2(2)  x  1  =  y/2  =  1.41  voltage  unit,  and  from  f  =  2  to  t  =  3  is  1.73  voltage 
units  and  so  on.  Thus  as  time  marches  along  the  time  index  n,  Vout2  ramps  up 
with  a  non-constant  slope. 

Next  we  scale  the  original  time  t  to  a  new  time  f ,  as  shown  in  Figure  7.58b. 
The  labels  for  the  time  axis  are  now  superscripted  with  primes  and  become  1', 
2',  3'.  In  the  new  time,  the  time  spacing  is  increasing.  Thus  change  in  time  from 
t1  =  0'  to  t1  =  1',  equals  1  time  unit,  and  change  in  time  from  ?'  =  1'  to  t'  =  2', 
equals  1.41  time  units.  In  turn,  change  in  time  from  t?  =  2'  to  i  =  3',  equals  1.73 
time  units.  This  is  shown  in  Figure  7.58b.  This  increase  exactly  matches  the 
change  in  Voua  in  the  original  time.  This  also  means  the  slew  rate  SR2  (primed 
to  indicate  time  scaled)  is  now  constant  (i.e.  5/?2'(l'),  at  t1  =  1',  is  the  same  as 
S7?2'(3'),  at  t1  —  3')-  This  continues  until  Mi  hits  threshold  at  time  t2'.  t2'  has 
mean  r2'  and  jitter  A?2'. 

So  far  we  have  concentrated  on  5/?2(?).  Similar  argument  applies  to  the 
current  noise  power  spectral  density,  S//P ,  which  is  responsible  for  the  noise 
part  of  VOH,2-  Working  out  the  mathematics,  it  was  shown  in  [27]  that,  in  the  new 
time  scale,  f,  S/„2'(0  (primed  to  indicate  time  scaled),  are  also  constant. 

In  summary,  whereas  in  time  t,  slew  rate  and  noise  psd  were  functions  of  time,  in 
scaled  time  i ,  they  become  constants.  Thus,  in  scaled  time,  assumption  2  becomes 
valid.  Therefore  f2',  Af2'  can  be  calculated  by  simply  dividing  the  barrier  and 
accumulated  noise  by  the  slew  rate,  respectively.  If  we  assume  the  noise  comes 


from  thermal  noise,  then  from  [12,  27],  the  accumulated  noise  is  4- 
At2'  are: 
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^=-^-  >Afe'  =  _L_^_  =  _L ^7—  (7-120) 

-       5tf'2'     2       V2     SR'2         ^2      SR>2 

Upon  rescaling  back  to  original  time  scale,  AtJ  is  stretched  by  a  factor 
relating  to  /?(=0.25  in  the  present  illustration;  in  general,  value  of  p  depends 
mainly  on  the  delay  cell  structure  and  whether  the  transistor  is  operating  in 
saturation  or  triode,  and  has  little  dependency  on  design  parameters  such  as 
current  I,  W/L  ratios  etc.),  and  Voua(f)  is  obtained,  as  shown  in  [27],  as: 


J*s„ 


At2  =  [wj  [^~1]  x  {2p  + 1}  [^"1]  x  Ti  ^srT         (7'121) 

C)  Assumption  3:  multiple  threshold  crossing 

The  consequence  of  assumption  3  is  that  with  technology  scaling,  power  supply 
is  reduced.  Hence  the  voltage  headroom  is  reduced  and  M;  can  get  into  triode  for 
a  portion  of  the  operation.  In  triode,  the  transistor  slew  rate  is  reduced  and  phase 
noise  deteriorates.  [26]  handles  assumption  3  by  proposing  using  replica  bias 
design  to  set  VSWing  so  as  to  maintain  Mj  remains  in  saturation,  if  possible.  If  that 
is  not  possible  [26]  gives  analytical  equation  that  predicts  the  phase  noise, 
including  the  effect  of  Mi  entering  triode. 

Let  us  elaborate  on  this. 

Returning  to  Figure  7.56a,  let  us  take  a  look  at  Mi.  V„,  is  at  0V.  Initially  Mi  is 
in  saturation,  and  7  =  Isat,  as  shown  in  the  /  versus  t  diagram  in  Figure  7.56c. 
Vout  increases  with  a  constant  slew  rate,  denoted  as  57?  i  in  the  Vout  versus  t 
diagram  in  Figure  7.56c  {SR\  in  Figure  7.56c  is  the  same  as  SR  in  Figure  7.56b). 
As  Vout  increases  and  crosses  the  sat/triode  value,  M\  enters  the  triode  region. 
However  since  IVdsl  of  M\  equals  V<jd  —Vout,  with  Mi  in  triode,  /  and  noise 
change  as  Vol„  change.  For  assumption  2  to  remain  valid,  we  set  7  and  noise 
power  spectral  density  (psd)  equal  their  respective  average  values  over  time 
(note  the  ensemble  average  is  different  from  the  average  referred  to  here,  which 
means  time  average.  The  ensemble  average  still  varies  with  time).  The  time 
average  value  of  7  in  the  triode  region  is  denoted  as  /„,0^,  as  shown  in  the  / 
versus  t  diagram.  Isa,  is  larger  than  I  mode-  Thus  as  M\  changes  from  saturation  to 
triode,  7  drops  from  Isat  to lmode-  As  7  decreases,  the  slew  rate  decreases,  so  when 
M\  switches  from  saturation  to  triode,  the  slope  of  Vou,  decreases  from  57?  i  to 
5/?2,  as  shown  in  the  Vout  versus  t  diagram.  Similarly  S/n  also  changes  when  Mi 
moves  from  saturation  to  triode. 

To  reiterate,  in  Figure  7.56c,  physically  when  M\  goes  from  saturation  to 
triode,  there  is  a  change  in  current  and  noise.  The  change  from  saturation  to 
triode  happens  at  Vout  —  V-,„  =  V,,  where  V,  is  the  transistor  threshold  voltage. 
Since  V,„  equals  0,  this  happens  at  Vout  —  V,.  Mathematically  Vout  can  be  viewed 
as  the  state  of  the  transition,  which  starts  from  0,  and  increases  towards  to  Vjd- 


7.9     Advanced  Phase  Noise  Model  of  Ring  Oscillators  425 

Thus  the  change  from  saturation  to  triode  region  can  be  interpreted  as  a  change 
in  state.  A  change  in  state,  in  turn,  mathematically  is  governed  by  the  Markov 
property  [28].  This  essentially  means  that  as  far  as  the  transition  is  concerned, 
what  happens  after  this  change  in  state  is  captured  by  what  happens  at  only  the 
instant  when  Vol„  =  V,  (or  sat/triode).  To  elaborate,  at  the  beginning  of  a  region, 
the  inverter's  voltages  and  currents  are  known.  They  specify  the  initial  condition 
(initial  voltages  and  currents).  Since  the  only  noise  source  taken  into  account  is 
white,  the  uncertainty  of  the  voltages  and  current  in  that  region  is  independent  of 
what  happens  before.  Thus  the  circuit's  voltages  and  currents  in  a  region  depend 
only  on  the  initial  condition  of  the  circuit  at  the  beginning  of  that  region  and  not 
before.  Therefore  they  possess  Markov  property. 

Because  of  noise  Vout  reaches  the  value  of  Vt  at  a  random  time,  in  the  same 
way  Vout  reaches  threshold  of  the  next  stage  inverter  at  a  random  time.  The 
randomness  in  time  for  Vout  to  reach  V,  is  characterized  by  a  probability 
distribution  (which  possesses  a  standard  deviation),  in  much  the  same  way  the 
randomness  in  time  for  Vout  to  reach  the  threshold  is  also  characterized  by  a 
(different)  probability  distribution  and  standard  deviation  (which  we  have 
denoted  as  the  timing  jitter).  To  repeat,  whereas  in  the  case  where  Vou,  crosses 
the  threshold  at  V^/2,  the  following  delay  cell  is  switched,  in  the  case  where 
Vout  crosses  the  value  V,,  Mi  is  switched  from  saturation  to  triode.  Therefore  Vout 
crossing  V,  can  also  be  viewed  as  crossing  a  threshold.  To  differentiate  we  call 
this  an  internal  threshold.  Just  like  the  random  time  to  threshold,  which  has  a 
mean  as  well  as  a  standard  deviation/timing  jitter,  the  random  time  to  this 
internal  threshold  also  has  its  own  mean  as  well  as  its  own  standard  deviation/ 
timing  jitter.  The  mean  time  to  the  internal  threshold  is  denoted  as  Fj~,  the  barrier 
is  b\  and  the  timing  jitter  is  At i .  We  further  label  the  transition  of  Vout  from  0  to 
the  internal  threshold  as  the  first  leg  of  the  transition. 

Returning  back  to  the  transition,  after  Vout  reaches  the  internal  threshold  it 
continues  to  rise  and  eventually  crosses  the  threshold.  We  label  this  as  the 
second  leg  of  the  transition.  Like  the  first  leg,  which  starts  from  a  known  state 
{Vout  —  0)  and  reaches  another  known  state  (Vou,  =  Vr),  the  second  leg  also  starts 
from  a  known  state  (Volir  =  V,)  and  reaches  another  known  state  (Vout  —  Vdd/2). 
Then  the  noise  injected  in  the  second  leg  impacts  its  crossing  of  the  threshold  in 
much  the  same  way  the  noise  injected  in  the  first  leg  impacts  its  crossing  of  the 
internal  threshold.  This  means,  like  the  first  leg,  the  time  associated  with  the 
second  leg  (a  random  time)  also  possesses  a  mean  (denoted  as  5),  a  barrier 
(denoted  as  hi)  and  a  timing  jitter  (denoted  as  Atj). 

Because  of  Markov  property,  the  influence  of  the  first  leg  on  the  second  leg  is 
captured  by  what  happens  only  at  Vou,  =  Vr,  and  is  otherwise  independent  of 
what  has  happened  earlier  in  the  first  leg.  What  happens  only  at  Vou,  —  Vt,  is 
captured  by  At t .  Because  Ati  and  A?2  are  independent,  At  1  can  be  added  to  A/9 
(in  the  r.m.s  sense)  to  obtain  Atd.  Thus: 

Atd  =  VA/i  +  Ah  (7.122) 
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For  completeness  we  also  have: 

j;,  =  h+^  (7.123) 

Now  referring  to  Figure  7.56c,  within  each  of  leg  1  and  leg  2,  Mj  stays  in  only 
one  region.  Thus  slew  rate  and  S/n  are  constant.  Also  in  the  present  situation  it  is 
assumed  that  not  only  does  Vout  cross  the  threshold  only  one  time,  it  also  crosses 
the  internal  threshold  (sat/triode)  only  one  time.  Thus  assumptions  (l)-(3)  are 
satisfied  and  so  we  can  easily  find  each  of  A?i  and  At2.  Thus: 

1        V'xS,„\  1        \'lS,„2 

Af,  =  -=  — ^—  and  A?2  =  —j=  — —  (7.124) 

^    ffii  "      V2    SR2 

Substituting  (7.124)  in  (7.122): 


A^J^H-^  (7.125) 

d      2\j     SRj  SR22  v         ; 

From  Figure  7.56c,  SR\  =  Isar/C  and  SR2  =  Itriode/C.   Since  Isat>hriode, 
SRi>SR2. 

As  before,  we  have: 

*=£"**-&  (7-126) 

bi  =  V,  and  b2  =  Vdd/2  -  V, 
Finally  we  have: 

b  =  h  +  b2  =  Vming/2  (7.127) 

The  final  equation  is: 

2 

To  J    foffse 

Atd  =  y/Ati  +  Ah-  P=0-25,  b^V^-^d-V,),  b2=Vdd-Vt 


,         N                 /     fV2N  x  Atd\       1     \ 
psd(foffset)  =  10  log  [fo  [?—= — —  (7.128) 


If  in  addition  to  assumption  3,  assumption  2  is  also  addressed,  (7. 124)  can  be 
further  refined  by  using  time  scaling,  and  we  have: 

Atl=(- ti  y^i]x(2J,+i)feH 

\SR-constJ  y  '         ' 
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Afe  = 


(7.130) 


1       h(h)  1       OiAh) 

SR-Consti  =  ==r  x  — — — ,  Noise-consti  ==pX  — "— — 


<i 


°7„,0i)  =  ySiJh)  =  \4-kT(ytriode(h)gds_oMi  +— j 


J-<Vons/2  —  =2?  x  ~~ 7= —      5      ^oiseco„st2  —  =p  X  p; 


(7.131) 


<x/„2 (fe)  -  V5/„2 (h)  =  \l*kT[  y8mui  (h)  +  —  )  (7.132) 


7.9.2     Unsaturated  Ring  Oscillators 

To  reduce  hardware  and/or  increase  oscillation  frequency,  the  number  of  stages,  N, 
in  a  ring  oscillator  can  be  reduced.  As  N  decreases,  the  oscillator  outputs  eventually 
become  unsaturated.  In  this  section  we  assume  N=2.  In  the  past  ring  oscillators 
with  unsaturated  outputs  have  been  reported  to  have  poorer  phase  noise 
characteristics  than  ring  oscillators  with  saturated  outputs  [3]  (all  other  things 
being  equal).  This  is  due  to  cycle  to  cycle  correlation.  Thus  unsaturated  ring 
oscillator  has  higher  oscillation  frequency  but  also  higher  phase  noise. 

To  look  at  the  phase  noise  of  unsaturated  ring  oscillator,  we  employ  all  the 
methods  that  handle  assumptions  1-3  in  7.9.1,  which  apply  to  saturated  ring 
oscillator.  In  addition  we  incorporate  the  effect  of  cycle  to  cycle  correlation, 
which  appears  only  in  unsaturated  ring  oscillator.  Similar  to  past  phase  noise 
models  on  unsaturated  ring  oscillator  we  focus  our  analysis  on  thermal  noise,  as  a 
first  attempt  to  investigate  the  impact  of  cycle  to  cycle  correlation.  The  1/f  noise  can 
likewise  be  treated,  as  presented  in  [25].  For  illustration  the  design  whose  delay  cell 
is  shown  in  Figure  7.59  is  adopted.  Transistors  M3,  M4  are  biased  in  the  triode 
region  by  the  bias  voltage  Pbias-  They  are  always  on  (never  turn  off)  since  Vout+  and 
Vout~  never  reach  Vdd.  Transistor  M5,  M6  are  configured  as  a  cross-coupled  pair. 
M3-M6  together  form  the  load.  An  explicit  capacitor  is  used  at  the  output.  This, 
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Fig.  7.59    Delay  cell  of  the 
unsaturated  ring  oscillator 
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together  with  any  parasitic  capacitance  coming  from  the  transistors'  device  capaci- 
tance, is  lumped  together  and  denoted  as  C.  The  use  of  an  explicit  capacitor  as  an 
additional  mean  to  control  delay  was  reported,  for  example  in  [4]  (see  fig. 7  of  [4]). 
Transistor  M^  M2  are  alternately  fully  switched  on  and  off  and  C  is  charged  and 
discharged. 

This  delay  cell  and  some  existing  designs  (e.g.  [24]),  which  are  gaining  in 
popularity,  are  similar.  Specifically  when  comparing  this  delay  cell,  and  these 
existing  designs,  with  the  more  conventional  differential  ring  oscillators  ([12], 
[27],  [28]),  this  delay  cell  and  these  existing  designs  all  have  an  additional  cross- 
coupled  pair  (M5,  M6).  This  cross-coupled  pair  is  supposed  to  speed  up  the  slew  rate 
and  thus  reduces  the  timing  jitter.  In  some  of  these  existing  designs  (see  fig.  5c  of 
[24])  there  is  no  explicit  capacitor  at  the  output  to  control  the  delay.  Instead  the 
parasitic  capacitances  hanging  at  the  output  nodes  are  used  to  control  the  delay. 
This  existing  design  also  has  an  extra  pair  of  diode  connected  PMOS  transistors 
(one  of  which  was  denoted  in  fig.  5c  of  [24]  as  M2)  that  are  connected  in  parallel  to 
M3,  M4  and  which  form  part  of  the  load. 

With  N=2,  we  denote  the  outputs  of  the  1st  and  2n  stage  as  Vi,  V2,  respectively. 
Next  let  us  concentrate  on  the  1st  stage.  Since  we  are  using  the  delay  cell  in  Figure 
7.59,  hence  Vout  now  is  V^  Because  this  is  a  2  stage  fully  differential  ring  oscillator, 
from  Figure  7.41,  the  output  of  the  2nd  stage,  V2,  is  flipped  and  fedback  to  become  the 
input  of  the  1st  stage.  Hence  Vin  now  is  —  V2  We  will  now  look  at  the  typical  voltages. 

First  let  us  take  a  look  at  the  waveforms  of  the  delay  cell.  Some  typical 
waveforms  of  the  differential  outputs  are  shown  in  Figure  7.60a  with  noise 
exaggerated  again  for  illustration  purpose.  From  Figure  7.60a  we  can  see  V1;  V2 
are  identical  except  shifted  in  the  time  axis  by  a  quarter  of  the  period.  It  is  seen  that 
Vi  starts  at  0  and  its  mean  ramps  up.  Meanwhile  V2  starts  at  A0  at  t=0  and 
independently  its  mean  ramps  down  with  V2  eventually  crossing  0  at  the  quarter 
period  ii.  At  the  same  time  Vi  reaches  A;.  Its  mean  then  turns  around  and  starts  to 
ramp  down.  Hence  Vi  is  never  given  a  chance  to  saturate  before  it  starts  to  ramp 
down.  Notice  the  amplitude  of  nth  quarter  period  is  denoted  as  An.  It  is  defined  as 
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Fig.  7.60a  Waveforms  of  the  differential  outputs  Vi,  V2.  In  times,  there  are  3  scales:  d  (durations), 
r(quarter  period)  and  T(period).  In  amplitude  there  is  A,  Amean,  <5  (noise  at  the  end  of  the  quarter 
period,  which  equals  A-Amean),  Vbk  (thershold),  b  (barrier)  £(fti3|Ti(<5o))  is  the  conditional 
expected  barrier,  conditioned  on  % 


the  magnitude  of  the  non-zero  crossing  voltage  process  at  the  end  of  the  n  quarter 
period  (the  zero  crossing  voltage  process  has  the  deterministic  value  zero  at  the  end 
of  the  nth  quarter  period)  e.g.  we  have  just  stated  that  at  x1;  V2  crosses  zero  and  so  it 
is  the  zero-crossing  voltage  process  for  the  1st  quarter  period,  Ti.  The  non-zero 
crossing  voltage  process  for  i\  is  then  Vi,  and  we  have  just  stated  that  its  magnitude 
at  the  end  of  T\  is  Ai.  Hence  the  amplitude  of  %\  is  A1=IV1(x1)l.  Similarly  A2  is  the 
amplitude  of  i2  and  A2=  IV2(i  2)l,  A3=IVi(t  3)1  etc. 

Now  as  Vi,  V2  are  differential  output  voltages,  we  show  in  Figure  7.60b  some  of 
the  waveforms  of  the  single-ended  output  voltages.  For  example  waveforms  for  the 
two  single-ended  output  voltages  corresponding  to  V1=Vout,  denoted  as  Vout+, 
Vout_  are  shown.  Also  shown  is  Vin.  Since  Vin,  Vout  are  identical  except  shifted 
in  the  time  axis  by  a  quarter  of  the  period  hence  Vjn,  Vout+,  Vout~  are  also  shifted  in 
the  time  axis  by  a  quarter  of  a  period.  Let  us  first  look  at  Vout+  where  it  is  seen  to 
center  at  Vcm,  the  common  mode  voltage.  It  swings  symmetrically  to  its  amplitude, 
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Fig.  7.60b    Typical  voltages  of  a  two-stage  unsaturated  ring  oscillator  using  delay  cell  in  Figure 
7.59.  Noise  is  exaggerated  and  superimposed  on  the  mean 


above  and  below  Vcm,  but  never  reaches  Vdd.  Essentially  Vout+  ramps  up  towards 
Vdd.  However  before  Vout+  reaches  Vdd,  V;n  crosses  zero  and  then  goes  above  zero, 
turning  off  M2,  turning  on  Mi  and  C  starts  to  get  discharged.  Hence  Vout+  turns 
around  and  starts  to  ramp  down  i.e.  it  is  never  given  a  chance  to  reach  Vdd  before  it 
starts  to  ramp  down.  Hence  the  ring  oscillator  has  an  unsaturated  output.  Finally  let 
us  look  at  the  waveform  Vin.  Following  [4]  we  assume  that  the  switching  threshold 
of  the  differential  pair  M1-M2  is  zero.  Hence  referring  to  Figure  7.59  this  means  that 
during  time  i1  (see  Figure  7.60b)  when  Vin  is  below  zero  M;  is  off  and  M2  is  on.  On 
the  other  hand  during  time  T2Vin  is  above  zero  and  so  M;  is  on  and  M2  is  off.  This 
means  M1;  M2  are  fully  switched.  Meanwhile  at  all  times  Vout+,  Vout~  never  reach 
Vdd.  This  means  M3,  M4  are  always  on.  Therefore  according  to  Figure  7.55  this  is  an 
example  of  an  unsaturated  oscillator. 
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Let  us  use  the  model  to  explain  qualitatively  where  the  cycle  to  cycle  correlation 
arises  from.  First  referring  to  Figure  7.60a,  80  is  the  randomness  in  V2,  at  t=0.  This, 
together  with  the  noise  injected  in  stage  2  during  x1;  is  the  reason  why,  Xi,  the  time  it 
takes  V2  to  ramp  down  and  cross  the  threshold  at  zero,  is  random.  This  randomness 
is  the  timing  jitter  of  X\.  Meanwhile  Vi,  (a  random  process  with  noise  injected  in 
stage  1  during  Xi),  starts  from  zero  at  t=0  and  ramps  up.  As  V2  crosses  zero  at  t=Xi, 
sign(V2)  changes  sign  and  hence  slew  rate  changes  sign,  or  Vj  finishes  ramping  up, 
turns  around  and  starts  to  ramp  down.  Hence  Vi(xi)  is  the  voltage  that  Vi  assumes 
at  the  moment  when  the  drift  of  Vi  turns  around.  V^Xj)  is  random  and  it  differs 
from  the  amplitude  mean,  Amean,  by  §1,  i.e.  81=A1—  A^a^lV^x  i)l— Amean.  Thus 
physically  8;  is  the  randomness  in  Vi  at  the  end  of  x;  and  it  involves  two 
contributions: 

a)  during  Xi,  Vi,  a  random  process,  has  noise  continuously  injected  to  it 

b)  the  random  process  also  runs  for  a  random  time  Xi 

Because  of  (b),  8;  is  a  function  of  Xi  and  hence  of  80. 

Next  let  us  look  at  V;  during  t2,  when  Vi  ramps  down  and  crosses  the  threshold 
at  zero  at  time  x:+x2.  Just  like  the  case  of  V2  previously,  now  Vi  also  starts 
with  randomness,  this  time  8;.  Again  together  with  the  noise  injected  in  stage  1 
during  x2,  is  the  reason  why  x2,  the  time  it  takes  V;  to  ramp  down  and  crosses  the 
threshold  at  zero,  is  random.  Again  x2  depends  on  the  initial  randomness  81 
However  since  we  have  already  established  that  8;  depends  onx,,  therefore  x2  in 
turn  depends  on  x;  and  so  there  is  correlation  between  these  two  consecutive  quarter 
periods. 

The  final  equation  of  the  phase  noise,  incorporating  cycle  to  cycle  correlation 
and  all  the  methods  that  handle  assumptions  1-3  in  8.1,  can  now  be  derived.  It  is 
however,  quite  complicated.  For  highlighting  the  effect  of  cycle  to  cycle  correlation 
we  now  present  the  phase  noise  equation  with  this  effect  alone.  Following  [25], 
this  is: 


psd((ooifset)  =    10  log  (  —    I     v;P  '  I  -}—    dBc/Hz  (7.133) 


Here  9,  the  correlation  coefficient,  summarized  the  amount  of  cycle  to  cycle  to 
correlation,  and  is  between  0  and  1.  As  it  increases  phase  noise  increases.  It  can 
be  analytically  related  to  the  circuit  parameters,  such  as  amplitude  of  oscillation, 
Amean,  slew  rate,  characterisitics  of  the  load  etc.  [25].  cje,  is  the  noise  source  and 
is  analytically  related  to  the  circuit  parameters,  such  as  tail  current  value, 
Itailmean,  etc.  [25].  For  thermal  noise,  aE  decreases  as  Itaii_mean  decreases,  since 
thermal  noise  (current)  depends  on  gm,  transconductance  of  the  devices,  which 
decreases  with  decreasing  gm.  [25]  shows  that  to  improve  phase  noise,  one  way  is 
to  improve  the  characteristics  of  the  load  so  that  correlation  is  reduced.  Shown  in 
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Fig.  7.61    I-V  characteristics  of  load,  with  V;n  set  to  be  negative;  impedance  is  negative  of  slope 


Figure  7.61  is  an  example  I  vs  V  characteristics  across  the  load,  used  to  charac- 
terize the  load  impedance.  The  slope  at  the  center,  sc,  is  positive,  because  of  the 
negative  impedance  presented  by  the  cross  coupled  pair  is  designed  to  be  more 
than  the  positive  impedance  presented  by  the  triode  based  load,  thus  having  an 
overall  negative  impedance  (hence  positive  slope).  The  slope  at  the  side,  ss,  is 
negative,  because  at  those  voltages,  the  cross  coupled  pair  is  turned  off,  leaving 
only  the  positive  impedance  presented  by  the  triode  based  load  (hence  negative 
slope).  An  I-V  characteristics  of  this  form  (i.e.  with  the  center  portion  of  the 
slope  positive)  is  not  unexpected  for  a  2-stage  ring  oscillator,  as  a  stable 
limit  cycle  can  be  achieved.  To  improve  phase  noise  we  can  increase  lssl,  lscl 
(defined  in  Figure  7.61),  while  keeping  Itaii_mean  constant.  Heuristically,  by  doing 
this,  the  resulting  waveform  is  less  "linear",  and  thus  perturbation  of  the  ampli- 
tude in  stage  l's  output  voltage,  due  to  noise,  is  going  to  have  less  perturbation  on 
the  change  in  time  at  which  it  crosses  the  threshold.  Stage  l's  output  voltage, 
being  the  same  as  stage  2's  input  voltage,  trigger  stage  2's  output  to  start  ramping 
up/down  upon  this  crossing  of  the  threshold.  With  less  perturbation  in  the 
crossing  time,  stage  2's  output's  amplitude  uncertainity  is  also  reduced.  Thus 
perturbation  in  the  amplitude  of  stage  l's  output  affects  less  the  amplitude  in 
stage  2's  output  and  subsequently  stage  l's  output  in  the  subsequent  cycle  (hence 
less  cycle  to  cycle  correlation).  It  should  be  remembered  though,  this  propagation 
of  perturbation  of  amplitude  from  1  cycle  to  the  next  comes  not  just  from  the 
noise  injected  at  the  peak  of  stage  l's  output,  but  also  from  noise  injected  into  the 
output  voltage  during  the  ramp  (up/down).  Thus  all  the  noise  should  be  included 
in  computing  the  correlation.  This  way  of  increasing  lssl,  lscl,  while  keeping 
Itaii_mean  constant,  can  be  done  by  decreasing  Pbias*    *   >  and   t-1,  as  well  as 


increasing   j1 , 


£,  and   Hi 

3  4 

and   ^4.  Since  Itaii  mean,  N,  Vdd  remain  constant,  this  means 


llii 
U 


power  remains  constant. 
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7.10    Problems 


7.1  A  new  PFD  implemented  with  RS-FF  is  shown. 


R          L 
S          1 

R         L 

S 

R          L 

R 

U 

RESET 

V 

S              L 
R          L 

R          L 

S 

R           L 

—  D 

a)  Given  the  following  R,  V  waveforms  (they  have  the  same  frequency  but 
different  phases),  show  the  U,  D,  reset  waveforms  and  explain  how  they 
perform  phase  detection. 


R  _^  ^  ^  ^ 


_T  I T  I T  I f 


b)  The  R-V  waveforms  are  changed  and  are  as  follows,  where  they  have 
different  frequencies. 


v  t  | T 


i~ 


J 


(i)  Show  the  U,  D,  reset  waveforms 
(ii)  Let  us  define  a  =  % ,  B  =  %^  = 

/R      r  TV 


1.7 


.&; 


A 


^and  (U-D)a 


as  the 
time  average  of  the  different  outputs.  Show  that  (U  —  D)ave  =  tt^q  ■ 
(Hint:  Start  by  observing  that  the  probability  of  a  single  V  transition  in 
[t,  t+TR]  (=a)-  Here  TR  is  the  time  interval  such  that  R  has  two 
successive  transitions  at  time  t  and  t+TR.  Also  assume  the 
corresponding   probability   density   function   within  that  interval  is 
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uniformly  distributed).  Plot  (U  —  D)ave  versus  P  and  explain  how  it 
achieves  frequency  detection. 

7.2  Shown  in  Figure  P7.1  is  a  new  type  of  phase  detector  (the  D-FF  are  positive 
edge  triggered).  The  phase  detector  output  is  being  fed  into  an  integrator  (used 
as  the  loop  filter  in  a  PLL). 


Delayed 
data 


Phase 

detector 

output 


a)  When  clock  and  data  are  aligned  as  shown  in  Fig.  P7.2,  plot  the  missing 
waveforms  in  the  diagram. 


7.10     Problems 
Delayed  Data 
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Clock     i 

_r 

Q(U3) 
Q(U4) 

Output  U I 


Output  U2 


Phase  detector 
output  1 


Loop  integrator 
Output 

Fig.  P7.2 


b)  When  the  data  is  ahead  of  clock  as  shown  in  Fig.  P7.3,  plot  the  missing 
waveforms  in  the  diagram. 
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Delayed  Data 


1_ 


Q(U3) 
Q(U4) 

Output  U I 
Output  U2 


Phase  detector 
output  1 


Loop  integrator 
Output 

Fig.  P7.3 


c)  Plot  the  transfer  characteristics  of  this  phase  detector  for  phase  error  varying 
from  — 2tt  to  2ti.  You  can  assume  maximum  data  transition  density. 

7.3  Let  us  design  another  phase  detector  (still  based  on  D-FF).  The  resulting 
waveform  is  shown  in  Fig.  P7.4. 

a)  Design  the  phase  detector  that  outputs  this  phase  detector  output  waveform 
(and  hence  the  corresponding  loop  integrator  output  waveform). 
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Delayed  Dala 


OCK      i— 


Phase  detector 
output 
2 

I 

0 


n 


Loop  integrator 
Output 

T/2  t 


Fig.  P7.4 


b)  Plot  the  waveforms  at  the  internal  nodes  of  this  phase  detector  (you  should 
include  at  least  output  of  all  the  D-FF). 

7.4  Figure  P7.5  shows  a  divider  circuit.  The  input  is  labelled  elk,  and  the  output  is 
labelled  Out.  There  is  a  control  input  labelled  MC.  Assume  that  elk  is  a  50% 
square  wave  clocking  at  1  GHz. 

a)  When  MC~0,  draw  the  waveform  along  the  signal  path  [i.e.  clk(Li),  Q(Li), 
D(L2),  Q(L2),  clk(FF,),  Q(FFX),  out,  A/Ct].  The  waveform  should  be  long 
enough  to  show  the  divider  action.  What  is  the  total  division  ratio? 

b)  Repeat  the  same  procedure  when  MC=\.  Now  what  is  the  total  division 
ratio? 
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Fig.  P7.5 


Fig.  P7.6 


7.10     Problems 
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7.5  Using  a  BJT  biased  at  Iq  —  1  mA,  design  a  Colpitts  oscillator  as  shown  in 
Fig.  P7.6  to  operate  at  ff>o  =  1.9  x  Mrrad/s.  Use  Ci=5.26  pF  and  assume  that 
the  Ci  available  has  a  Q  of  100  (this  can  be  represented  by  a  resistance  in 
parallel  with  C\  whose  value  is  given  by  QI(<aQC\).  Also  assume  that  RL=2kD. 
and  that  for  the  BJT,  r0  =  100  kQ.  Find  C2  and  L. 

7.6  Figure  P7.7  is  a  LC  based  oscillator.  The  block-level  diagram  is  given  in  Figure 
P7.7a  and  the  transistor-level  diagram  is  given  in  Figure  P7.7b. 

a)  Explain  the  operation  of  the  circuit  in  Figure  P7.7b  by  explaining  how  it 
oscillates.  In  the  circuit  in  Figure  P7.7b  there  is  a  positive  feedback  path  and 
a  negative  feedback  path.  Identify  the  paths. 

b)  Now  we  want  to  tune  the  circuit  using  a  varactor  diode.  Show  how  we  can 
modify  the  circuit  in  Figure  P7.7b  to  achieve  that.  Show  the  complete 
modified  circuit.  Derive  the  Kvco  for  this  modified  circuit. 

7.7  In  this  question  we  will  investigate  the  phase  noise's  PSD  of  the  LC  oscillator 
as  described  in  Figure  P7.6.  Since  this  oscillator  is  a  resonance  based  oscillator, 
its  phase  noise  PSD  will  be  derived  by  the  simplifying  assumption  that  the 
oscillator  is  an  LTI  system  (a  gross  approximation). 


« * 


L        4=  '■ 


^> 


Bias 


i  ,  r   cm 

© 

1 


Output 


(a) 


(b) 


Fig.  P7.7 


To  derive  the  phase  noise's  PSD  of  this  Colpitts  oscillator,  it  is  represented  in 
the  block  diagram  in  Figure  P7.8. 

Using  this  block  diagram,  one  can  express  noise  at  the  output  as  a  product  of 
noise  injected  at  the  summing  node  and  the  corresponding  transfer  function. 
The  detail  has  been  worked  out  at  [9]  with  the  results  given  below: 


A/ 

v|05) 


(ft)' 


d{af) 


(P7.1) 


Here 


v2(  f) 

-tj^1  =  PSD  of  noise  at  the  output  (S0)  of  the  VCO,  at  an  offset  frequency 
f;  from  the  center  frequency  f0. 
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Fig.  P7.8 


S,(s) 


>l 

f                                        \ 

a(s) 

So 

s) 

J 

V 

f(s) 

S,b(s) 

Fig.  P7.9 


V„  °— 1 * 


in 


Vtun,    O 


€ 


€ 


M 


■: 


vhjn(  C— 


V2  f  ft ) 

-J^1  =  PSD  of  noise  injected  at  the  summing  node  of  the  VCO  (just  before  the 

a(s)  block),  at  an  offset  frequency  fj  from  the  center  frequency  f0. 
a=forward  gain 
f=feedback  factor 

a)  Derive  expressions  for  a(s)  and  a(s)f(s)  in  Figure  P7.6  in  terms  of  circuit 
elements  RL,  L,  Ci,  C2,  gm  of  Q;.  (Hint:  a  good  starting  point  is  the  transfer 
function  described  in  (7.47),  derived  for  the  transformer  coupled  LC  oscil- 

lator).  Then  substitute  these  into  (P7.1)  and  find  an  expression  for  ==  (you 

T 

can  assume  the  only  noise  source  is  lumped  as  an  equivalent  input  noise 
source  placed  at  the  base  of  transistor  Qi) 
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b)  Assume  the  noise  injected  at  the  summing  node  of  the  oscillator,  which  is 
represented  by  an  equivalent  voltage  noise  source  v\,  has  an  equivalent 
noise  resistance  =  j—  where  gm  is  the  transconductance  of  Ql.  Write  down 

the  expression  for  ^jM  and  hence  derive  the  expression  for  ^4^ . 

c)  Derive  an  expression  for  v*arrier,  the  mean  square  voltage  of  the  oscillator 
output  voltage  when  no  noise  is  injected.  This  is  also  called  the  carrier 
signal.  (Hint:  a  good  reference  is  [24]).  Using  the  same  values  from  problem 
7.5,  calculate  the  numerical  value  of  v2carrier.  Then  use  this  value  and  the 

expression  of  ^W-  derived  in  (b)  to  find  an  expression  for  Se_vco(fi). 

7.8  Fig.  P7.9  shows  the  circuit  schematic  of  a  non-saturated  delay  cell.  Two 
identical  cells  are  shown,  one  driving  the  other.  Assume  that  a  method  is 
devised  such  that  the  cell  can  switch  fast  so  that  its  switching  waveform 
resembles  that  of  a  fast-slewing  saturated  delay  cell.  Derive  an  expression  for 
the  Kvco  of  the  ring  oscillator  built  using  these  non-saturated  delay  cells.  The 
expression  should  be  in  terms  of  gm,  C^,  C^  and  other  device  parameters  of 
specific  transistors,  A^number  of  stages),  and  VSWjng.  You  can  assume  Vdd- 
VtUne'  equals  Vtune. 
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Chapter  8 

Frequency  Synthesizer:  Loop  Filter 

and  System  Design 


8.1     Introduction 

In  Chapter  7  we  discussed  the  phase/frequency  processing  elements  part  of  a 
frequency  synthesizer.  We  noted  that  due  to  noise,  mismatch  in  these  components, 
spurs,  and  phase  noise  are  generated.  In  this  chapter  we  investigate  how  spurs  and 
phase  noise  from  individual  components  affect  the  spurs  and  phase  noise  of  the 
complete  synthesizer.  To  discuss  these  issues,  we  need  to  know  the  transfer 
functions  from  these  components  to  the  synthesizer  output.  These  transfer  functions 
depend  on  the  loop  filter  of  the  synthesizer.  Hence  we  start  by  discussing  the  loop 
filter.  We  analyze  the  loop  filter  from  the  view  point  of  using  it  as  a  key  to  trade  off 
spurs  and  phase  noise  while  maintaining  stability.  Then  we  provide  a  design  flow 
chart  of  the  synthesizer  and  a  detailed  design  example  of  a  synthesizer  that  is  used 
in  a  DECT  receiver  front  end. 


8.2     Loop  Filter:  General  Description 

In  this  section  we  discuss  the  loop  filter,  the  single  most  important  component  in  the 
synthesizer  with  regard  to  providing  trade-off  of  various  performances  (e.g.,  spur, 
phase  noise,  capture,  and  lock  range). 


8.2.1     Basic  Equations  and  Definitions 

We  start  by  introducing  the  proper  definition  of  the  following  terms  [5]:  filter 
transfer  function  F(s),  open-loop  transfer  function  G(s),  and  closed-loop  transfer 
function  H(s)  of  the  synthesizer.  They  will  be  used  again  later  in  the  design  of  the 
complete  synthesizer. 

B.  Leung,  VLSI  for  Wireless  Communication,  DOI  10.1007/978-l-4614-0986-l_8,  443 
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Wr) 


lri 


Fig.  8.1    Block  diagram  of  synthesizer 


F(S) 


+  M 


dOr 


ipd  =  Kpdee  =  Kpd(er-e0/M) 

PD/charge  pump  VCO 


jt  -  KVCOvlf 


-*  f.(e0) 


In  Figure  8.1  we  redraw  the  block  diagram  of  the  PLL  in  Figure  7.2a.  Note  that 
we  have  reinserted  the  divider  M.  As  stated  in  sub-section  7.3.3  (Chapter  7)  the  PD/ 
charge  pump  is  now  represented  by  an  LTI  circuit  with  gain  Kpd.  Also  we  stated  in 
sub-section  7.3.1  (Chapter  7)  that  we  assume  that  no  cycle  slipping  occurs.  Hence, 
the  whole  synthesizer  is  an  LTI  system  and  can  be  described  using  the  familiar 
Laplace  transform. 

We  define  everything  in  phase,  use  S-domain  representation,  and  generalize  the 
filter  from  a  simple  capacitor  to  a  general  network  with  transfer  function  F(s). 
Hence  we  generalize  (7.1),  (7.4a),  (7.4c)  to  the  following  equations: 


Ms)  = 


Kvcovlf(s) 


i'pd(s)  =  KpdQe(s)  =  Kpd[Qr(s)  -  Q0(s)/M] 


vtt{$)  =  F(s)ipd(s) 


(8.1) 

(8.2) 
(8.3) 


From  the  preceding  equations  we  can  derive  the  transfer  function  between  any  two 
variables.  The  most  important  one  is  the  closed-loop  input/output  transfer  function: 


H(s)  = 


60(g)       KvcoKpdF(s) 


KvcoKpllF(s) 
M 


(8.4) 


Another  transfer  function  of  interest  is  the  error  transfer  function,  which  can  be 
related  and  expressed  in  H(s): 


,to 


>rM 


K, 


pd 


1 


H(s) 


M 


The  open-loop  transfer  function  G(s)  is  simply  defined  as 

KpdKv, 


G(s) 


sM 


-F{s) 


(8.5) 


(8.6) 


As  can  be  seen,  all  the  transfer  functions  depend  on  the  loop  filter  F(s),  and  so  a 
review  on  the  basics  of  filter  design  is  in  order. 


8.2     Loop  Filter:  General  Description 
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Fig.  8.2    First-order  loop 
filter 


7d  © 


8.2.2     First-Order  Filter 

The  simplest  loop  filter  consists  of  a  single  capacitor  C  (this  is  driven  by  current  as 
opposed  to  voltage  source),  as  shown  in  Figure.7.4.  Such  a  simple  first-order  filter 
(which  is  simply  an  integrator)  has  very  limited  flexibility  on  optimizing  the 
performance,  as  will  be  shown  later.  As  a  result  there  is  a  need  for  more  complex 
filters.  A  general  first-order  filter  is  shown  in  Figure  8.2. 
For  this  first-order  filter  we  have 


FM 


R\{sR2C+\) 
l+s(R2+Ri)C 


(8.7) 


Figure  8.3a  and  Figure  8.3b  show  the  frequency  responses  of  F \(s)  and  G\(s). 
Substituting  (8.7)  into  (8.6)  the  loop  transfer  function  becomes 


Gi(s)  = 


K?aKvco  p  {  \  _  KpdKvco(s  +  z\) 
Ms        ' [S)  ~     Ms(s+p2) 


(8.8) 


Notice  from  Figure  8.3b  that  the  unity  gain  frequency  of  the  loop  transfer 
function  is  a>„. 

Now  (8.8)  can  be  rewritten  as 


Gi(s)-- 


KycoKpd  Ri(l+sx2)  _ KpdKVC0R\  1  s  +  t2  X2 
Ms        (l+sxi)  M        ss  +  ^-Xi 

KpdKvm  ,s  +  z{ 

-("l||"2J- 


sM 


S+P2 


(8.9a) 


Here 


zi  =  -  =  R2C 

12 


(8.9b) 
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Fig.  8.3    Frequency  responses  of  first-order  loop  filter  and  resulting  open-loop  transfer  function 


1 
P2=  —  = 


1 


ti      (Ri+R2)C 


(8.9c) 


How  about  Hj(s)7  Substituting  (8.9b)  (8.9c)  into  (8.7)  and  then  the  resulting 
Fj(s)  in  (8.4),  we  have 


Hi(s) 


gvcoffpdgl(ST2+  0 


(8.10) 


The  Hj(s)'s  frequency  response  is  shown  in  Figure  8.4,  where  ro„  is  the  natural 
frequency  of  the  frequency  response  and  £  is  the  damping  factor. 
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#,(./co) 


M 


Fig.  8.4    Frequency  response  of  closed-loop  transfer  function  with  a  first-order  loop  filter 

8.2.3     Second-Order  Filter 

A  second-order  filter,  whose  transfer  function  is  denoted  as  F2(s),  is  shown  in 
Figure  8.5.  This  is  one  of  the  most  popular  filters  [4]  for  frequency  synthesizer 
applications.  When  compared  with  the  first-order  filter  in  Figure  8.2,  the  present 
filter  has  one  fewer  resistor  but  one  more  capacitor.  The  elimination  of  the  resistor 
means  one  pole  is  moved  to  the  origin.  The  extra  capacitor  makes  the  filter  a 
second-order  filter  and  the  synthesizer  a  third-order  loop.  The  extra  capacitor  C3 
is  just  put  there  for  second-order  consideration  (mainly  to  smooth  off  voltage  jumps 
across  R2  due  to  current  switching),  and  the  extra  pole  it  introduces  is  far  away  from 
the  main  pole.  Hence  for  most  purposes  (like  stability),  the  filter  behaves  like  a  first- 
order  filter  and  the  synthesizer  behaves  like  a  second-order  synthesizer  with  similar 
transfer  functions.  The  transfer  function  of  this  filter  is 


Fi(s) 


S  +  Z\ 

sCiis  +  pi) 


(8.11a) 


where 


-\ 


R2C 


(8.11b) 


and 


Pi  = 


R^m 


(8.11c) 
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Fig.  8.5    Second-order  loop 
filter 


Pd 


V. 


=P     C 


Alternatively,  it  can  be  written  as 


where 


and 


Fi(s) 


b  -  1\        1  +  SX2 


b    JsCft+1) 


b=  1 


C 


i2  =  RiC 


(8.12) 


(8.12b) 


(8.12c) 


Equation  (8.12a)  shows  that  F2{s)  has  the  dimension  of  1/Q  because  it  takes  a 
current  input  ;pd  (from  the  charge  pump)  and  generates  a  voltage  output,  Vjf.  As  was 
shown  in  [4],  typically  b  is  large  and  F2(s)  becomes 


Fi(s) 


1  +  ST2       R2(s  +  zi 


sC 


where  zx— 1 /R2C. 
Now  again  assuming  that  b  is  large,  F2(s)  can  also  be  written  as 


Fi(s) 


C      s(s  +  p3) 


Here 


-i 


2>3 


1 

b 

^2 


(8.13) 


(8.14) 

(8.14a) 
(8.14b) 
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Fig.  8.6a    Frequency  response  of  second-order  loop  filter 


|G2(/co) 


OdB 


>  (0 


(b) 

Fig.  8.6b    Frequency  response  of  open-loop  transfer  function  with  a  second-order  loop  filter 
Substituting  (8.12a),  (8.12b)  into  (8.6),  we  get  G2(s). 


G2(s)  = 


^"pd^vco  s  +  Zi 


sM       s2C3(s  +  p3) 


(8.15) 


Alternately,  using  the  approximated  form  for  F2(s)  as  given  in  (8.12a)  and 
substituting  that  into  (8.6)  we  get 


(his) 


KpdKvco    R2  (s  +  Zi ) 
sM  s 


(8.16) 


F2(s)  and  G2(s)  are  plotted  in  Figure  8.6a  and  Figure  8.6b  respectively. 
Finally,  substituting  (8.12a)  into  (8.4),  we  get  H2(s). 
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8.2.4    High-Order  Filters 

We  have  given  examples  of  first  and  second-order  loop  filters.  Now  we  generalize 
to  a  loop  filter  of  arbitrary  order  [5]. 

Substituting  (8.6)  into  (8.4),  and  we  have 

For  an  (n-l)th-order  loop  filter,  the  transfer  function  should  be 
=  FWs-zMs-v)     {s-zm) 

[S  - p2)[S  ~ p3)...[S  ~ p„) 

In  the  denominator  we  start  with  p2,  rather  than  px  (the  last  pole  becomes,  of 
course,  p„).  We  are  doing  this  in  anticipation  of  embedding  F(s)  inside  the  synthe- 
sizer, where  px  [this  is  counting  the  pole  of  the  synthesizer,  not  just  F(s)]  comes 
from  the  VCO.  Of  course,  p1  is  at  DC.  F(oo)  in  the  numerator  is  the  normalizing 
factor.  If  the  filter  is  to  be  realizable,  m  <  n  —  1. 

Assuming  that  all  poles  are  simple,  we  can  expand  F{s)  in  partial  fractions, 
substitute  in  (8.6),  and  rewrite  the  open-loop  transfer  function  as 


G(,)=§ 


tf;+i 


ax  +  V 

U*-Pi 


(8.19) 


where  /?,  are  the  poles,  a,  are  the  residues,  and  K  is  the  normalizing  constant. 
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In  sub-section  7.3.4,  7.3.5  and  7.8.3,  we  stated  that  to  calculate  the  output  spur  and 
phase  noise  due  to  the  PD(=G0)  and  the  output  phase  noise  due  to  the  VCO 
(=Se0_vco)>  we  need  me  transfer  functions  (Hmf  or  Hnvco)  from  the  particular  source 
of  disturbance  (PD  or  VCO)  to  the  synthesizer  output.  Similarly  output  phase  noise 
from  the  reference  also  depends  on  the  transfer  function  HTef.  These  transfer 
functions  depend  on  F(s).  To  generalize,  spur  or  noise  from  internal  nodes  affects 
the  synthesizer  output  depending  on  the  transfer  functions  from  these  internal  nodes 
to  the  output.  Hence  we  conclude  that  the  loop  filter  affects  the  performance  of  the 
synthesizer  through  its  impact  on  these  transfer  functions.  In  addition  F(s)  also 
affects  the  stability  of  the  synthesizer.  Therefore  the  design  of  the  loop  filter  is 
guided  by  achieving  an  optimal  compromise  between  these  impacts.  There  are  two 
approaches:  phase  noise  based  and  spur  based  approaches. 
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8.3.1     Phase  Noise  Based  Approach 

In  this  approach  we  design  F(s)  based  on  primarily  phase  noise  and  stability 
requirements.  The  resulting  synthesizer  can  then  be  checked  and  see  if  it  meets 
the  spur  requirement.  Following  this  philosophy,  it  turns  out  that  phase  noise  has  a 
more  direct  (one  to  one)  dependency  on  G(s).  Hence,  we  would  first  design  G(s) 
based  on  phase  noise  requirement.  Then  from  G(s)  we  can  in  turn  determine  F(s). 

8.3.1.1     Phase  Noise  Requirement 

First  we  describe  qualitatively  the  impact  of  the  synthesizer's  phase  noise  on  the 
receiver  front  end's  performance.  These  impacts  are  then  quantified  into  specific 
requirements  on  the  synthesizer  [10],  [11]. 

We  assume  that  the  output  of  the  synthesizer  from  Figure  8.1  consists  of  a 
sinusoidal  waveform  with  frequency  fa.  When  corrupted  with  phase  noise,  the 
synthesizer  output  can  be  written  as 

V0  =  cos(ft)0?  +  A8(f))  (8.20) 

Here  A9(t)  is  the  random  phase  fluctuation  due  to  noise  inside  the  synthesizer.  If 
we  compare  (8.20)  to  (7.114),  we  find  that  they  have  similar  form.  Of  course,  A9(f) 
in  (8.20)  refers  to  phase  noise  of  the  synthesizer  whereas  A9(?)  in  (7.1 14)  refers  to 
phase  noise  of  the  oscillator  alone.  They  would  have  different  PSDs.  However,  the 
interpretation  of  phase  noise,  as  outlined  in  Section  7.8,  applies  in  both  cases.  In 
other  words,  the  PSD  of  A9(f),  denoted  as  5e_syn(/),  can  be  interpreted  both  in  the 
phase  and  amplitude  domain. 

Now  the  synthesizer's  phase  noise  has  two  independent  impacts  on  the  receiver 
front  end's  SNR  and  hence  BER.  The  first  impairment  is  called  phase  impairment, 
and  is  best  explained  by  interpreting  the  A9(?)  term  in  (8.20)  in  the  phase  domain. 
To  facilitate  explanation  let  us  assume  a  heterodyne  architecture  is  used  in  which 
the  frequency  synthesizer  mixes  with  the  received  signal  and  downconverts  it  to  an 
IF  signal.  This  IF  signal  is  then  demodulated  coherently  using  a  carrier  recovery 
PLL  loop.  (This  carrier  recovery  PLL  loop  is  separate  from  the  frequency  synthe- 
sizer.). The  carrier  recovery  PLL  loop  output  is  assumed  to  demodulate  a  phase 
encoded  signal,  like  the  BPSK  encoded  signal  as  described  in  Chapter  1.  The  carrier 
recovery  PLL  loop  output  has  phase  impairment  similar  to  a  frequency  synthesizer 
and  hence  its  output  is  also  given  by  (8.20).  Therefore  the  original  equations  (1.3) 
and  (1.4),  rewritten  as 


logic  0  :      sQ(t) 


A  Th 
(A  cos  (Hot)  (A  cos  (i)ot)dt  = (8.21) 
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T„ 


logic  1  :       Si(t) 


(—A  cos  coq?)(A  cos  a>Qt)dt  ■■ 


A2Th 


(8.22) 


become 


logic  0  :      s0  (f)  = 


(A cos a>ot)(A cos(ff>0?  +  AQ(t)))dt  ^ 


A2Th 


(8.23) 


logic  1  :      S\  (t)  = 


{-A  cos  coof)(^  cos(co0f  +  AQ(t)))dt  ^ 


A2Th 


(8.24) 


We  can  see  that  because  of  A9(f)  in  (8.23),  (8.24),  upon  demodulation  the  result 
is  no  longer  simply  ±A2Th/2.  Again,  referring  to  Section  1.3,  the  Eh  is  no  longer 
simply  A2Th/2.  This  will  degrade  the  Pe,  or  BER.  Let  us  return  to  our  original 
discussion  and  ask  what  the  impact  of  the  AQ(t)  in  the  synthesizer  output,  which  is 
given  by  (8.20),  has  on  the  demodulated  signal.  Remember  the  synthesizer  output  is 
used  to  mix  with  the  received  signal  and  downconvert  it  before  the  IF  signal  is  being 
demodulated  using  the  carrier  recovery  PLL  loop  output.  However  in  the  mixing 
operation,  this  A9(?)  in  the  synthesizer  would  have  corrupted  the  phase  of  the 
downconverted  signal.  Hence  the  first  cos(<D0f)  terms  in  (8.20),  (8.21)  (which 
represent  the  IF  signal)  will  become  cos(co0f+A9(r))  (Contrast  this  with  the  previous 
case  when  we  state  the  only  A9(?)  comes  from  the  carrier  recovery  PLL  loop  and 
hence  the  second  cos(o>0f)  terms  become  cos(co0?+A9(0))-  From  (8.20),  (8.21)  it 
does  not  matter  whether  A9(0  is  introduced  in  the  first  cos(ff>0t)  terms  or  second  cos 
(u)0t)  terms.  (8.23),  (8.24)  still  follow.  Accordingly  in  the  demodulation  process,  the 
A9(?)  introduced  by  the  synthesizer  degrades  the  final  Pe  in  exactly  the  same  way 
that  the  A9(0  in  the  carrier  recovery  PLL  loop  does.  Of  course  if  we  have  A9(?)  in 
both  the  synthesizer  and  carrier  recovery  PLL  loop  then  both  the  first  and  second 
cos(co0?)  terms  in  (8.20),  (8.21)  will  be  corrupted. 

The  second  impact,  called  reciprocal  mix,  comes  about  because  of  the  presence 
of  adjacent  channel  interference.  This  is  best  understood  by  viewing  the  5,e_syn(/) 
associated  with  the  AQ(t)  term  in  (8.20)  in  the  amplitude  domain.  Se_syn(/)  for  a 
typical  synthesizer  has  the  shape  as  shown  in  Figure  8.7. 

Hence  we  can  view  the  phase  noise  as  making  the  output  tone  not  pure.  If  strong 
enough  it  will  mix  the  strong  adjacent  channel  interference  down  to  the  desired  IF. 
The  frequency  domain  explanation  of  how  this  happens  is  given  in  Figure  8.8  and 
Figure  8.9.  To  demonstrate  this  mixing  of  adjacent  channel  phenomeon,  Figure  8.8 
takes  Figure  8.7  and  represents  all  the  relevant  phase  noise  energy  on  each  side  of/0 
by  two  frequency  impulses.  These  impulses  are  situated  at  frequencies  denoted  as 
/0±/noise.  We  assume  that  this  phase  noise  energy  is  situated  one  channel  away 
from/0  and  hence  we  have  /noise  =/Channei-  Next  we  go  to  Figure  8.9,  where  for 
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Fig.  8.7    Power  spectral  density  of  synthesizer  phase  noise 
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Fig.  8.8    (a)  Power  spectral  density  of  synthesizer  phase  noise,  (b)  Equivalent  representation  using 
frequency  impulses 


simplicity  only  the  /0+/noiSe  impulse  is  drawn.  As  shown  in  Figure  8.9,  this 
undesired  sideband  energy  from  the  synthesizer  (phase  noise)  "reciprocal  mixes  " 
with  an  adjacent  channel  interference  at  /desire+/channei  (blocker)  and  manifests 
itself  as  noise  on  top  of  the  desired  signal.  The  final  effect  is  pretty  much  the 
same  as  that  due  to  intermodulation  between  two  adjacent  channel  interferences  as 
described  in  subsection  2.5.2;  that  is,  a  reduction  of  SNR  and  hence  larger  BER. 

Notice  that  the  phase  impairment  effect  is  always  there,  with  and  without 
interference,  whereas  the  reciprocal  mix  effect  is  there  only  when  there  is  adjacent 
channel  interference.  Which  impact  is  more  dominant  depends,  among  other 
factors,  on  the  applications. 

We  assume  that  the  frequency  synthesizer  is  used  to  generate  the  LO  signal  for 
the  first  mixer  in  the  heterodyne  architecture  of  Chapter  2. 

Criterion  1:  For  phase  impairment  the  final  degradation  in  BER  depends,  in  a 
complicated  fashion,  on  the  modulation  scheme  and  statistics  of  A0(f).  The  rela- 
tionship between  this  integrated  phase  noise  and  the  resulting  BER  is  best  obtained 
via  simulation  [10].  This  will  then  give  us  a  requirement  on  the  tolerable  integrated 
phase  noise.  As  an  example,  in  [  10]  for  QPSK  modulation,  a  SNR  of  9  dB,  a  BER  of 
10~3  (these  are  close  to  the  requirements  for  DECT)  the  syntheszier  integrated 
phase  noise  should  be  less  than  -15  dBc.  Hence  we  build  in  some  safety  margin  and 
specify  a  requirement  of  -20  dBc  for  DECT. 
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Fig.  8.9    Explanation  of  reciprocal  mixing  phenomenon 
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Criterion  2:  Next  we  quantify  the  reciprocal  mix  effect  [11].  Here  the  phase  noise 
must  be  minimized  such  that  under  the  worst-case  blocking  scenario,  the  power  of 
the  reciprocal  mixed  interferer  is  below  the  desired  signal  level  by  the  minimum 
SNR  the  demodulator  required,  denoted  in  Chapter  2  as  SNRdemodin. 

In  determining  the  phase  noise  figure,  the  following  simplifying  assumptions 
were  made: 

1 .  The  receiver  channel  and  front  end  are  assumed  to  be  noise  free. 

2.  Phase  noise  is  assumed  to  be  flat  across  the  band  of  interest  at  a  certain  offset 
from  the  carrier. 

3.  The  only  interference  produced  within  the  signal  band  moving  through  the 
receiver  front  end  is  due  to  the  phase  noise  reciprocal  mixing  with  out-of-signal 
band  blockers. 

In  general,  the  calculation  of  allowable  phase  noise  of  a  frequency  synthesizer 
for  the  first-stage  mixer  can  be  summarized  by  the  following  formula  [11]: 

10  log  S^Mlowabh -jyn  (fi)  {Hz)  =  1 0  log  ^interference  (fi )  (dBm/dB V) 

+  101ogSdestodC/D(dBm/dBV) 

-  SNRdemodJn(dB)  -  10  log(BW))  (8.25) 

Here  10  log  5'e_aiiowabie_syn(./i)  is  the  PSD  of  allowable  phase  noise/)  away  from 
carrier  in  dBc/Hz;  10  log  ^interference  is  the  power  of  the  interference  in  dBm  or  dBV 
at  the  mixer  input;  10  log  Sdesired  is  the  power  of  the  desired  signal  in  dBm  or  dBV  at 
the  mixer  input;  SNRdemodin  is  the  required  signal-to-noise  ratio  at  the  demodulator 
input;  BW  is  bandwidth  of  the  desired  signal. 

We  illustrate  the  calculation  by  using  the  DECT  standard  and  heterodyne 
architecture  given  in  Chapter  2.  Considering  the  DECT  standard,  the  maximum 
^interference  can  t>e  at  —  62  dBm  (worst  case)  compared  with  a  minimum  Spired  at 
—  77  dBm  (worst  case).  First  we  note  that  if  BPF1,  LNA  provide  the  same  gain  to 
interference  and  desired  signal,  then  their  difference  in  power  level  remains  the 
same  when  we  go  from  antenna  to  mixer.  Hence  -10  log  Smterference+lO  log  Sirred 
=—(—62  dBm)  +  (—77  dbm).  Second,  we  conclude  from  assumptions  1  and  3  and 
Chapter  2  that  101ogSNRdemod_in  =  25dB.  BW  of  DECT  is  1.728  MHz.  Substitute 
all  these  in  (8.25),  we  have: 

lOlog  Se_allowable_sy„ (/channel) (dBc/Hz)  =  -(-62dBm)  +  (-77dBm)  -  25dB 

-  10 log(l. 728  MHz) 

=  -102  dBc/Hz 

(8.26) 

Let  us  go  back  and  use  these  requirements  to  guide  the  design  G(s).  We  first 
repeat  (8.6)  here: 

F(s)KpdKvco 
sM 
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Fig.  8.10    Power  spectral  density  of  phase  noise  sources 

How  do  we  determine  G(s)1  The  first  thing  we  want  to  find  out  about  G(s)  is  its 
unity  gain  frequency,  co„.  We  note  that  co„  of  G(s)  is  directly  related  to  the 
synthesizer  phase  noise  requirement.  Hence  we  need  to  find  Se_syn's  dependency 
on  co„  of  G(s). 


8.3.1.2     Power  Spectral  Density  of  Synthesizer  Phase  Noise  (5e _syn) 

To  find  5e_syn's  dependency  on  co„,  we  need  to  obtain  the  following  information  [8]: 

1 .  PSD  of  phase  noise  sources 

2.  Transfer  function  from  phase  noise  sources  to  synthesizer  output 

To  help  clarify  notation  we  define  co,  (/))  as  the  frequency  offset  from  the  oscillation 
frequency  co0  (fa)  in  rad/s  (Hz).  This  is  to  be  distinguished  from  co  (or/),  the  absolute 
frequency  under  consideration.  Of  course,  co,-  =  co  —  co0    (/;  =/  — /„). 


PSD  of  phase  noise  sources 


1.  Figure  8.10  shows  5e_Vco(ff>)  from  the  VCO.  In  the  ring  oscillator  case  we 
discussed  that  its  phase  noise  has  l/(co-co0)  characteristics,  and  we  can  general- 
ize that  by  saying  that  in  general  oscillator's  phase  noise  PSD  has  a  bandpass 
characteristics  around  co0  [8],  the  oscillation  frequency,  as  shown  in  Figure  8.10. 

2.  There  are  other  sources  of  phase  noise  (other  than  from  oscillators).  As  a  matter 
of  fact,  any  noise  sources  in  the  synthesizer  (from  divider,  phase  detector,  charge 
pump,  reference,  noise  coupling  from  substrate  and  supply)  can  result  in  phase 
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noise.  For  the  time  being,  we  assume  that  aside  from  VCO  noise  source, 
reference  noise  is  the  other  major  contributor  to  phase  noise  [9].  We  further 
know  that  PSD  of  reference  phase  noise,  denoted  as  Se_ref  (ff>),  is  relatively  white 
around  co0  [8],  as  shown  in  Figure  8.10  (in  practice  it  will  start  to  rise  as 
frequency  becomes  close  enough  to  co0  [9]).  This  flat  PSD  is  then  taken  to 
have  a  value  equal  to  K  dBc/rad.  When  referred  to  the  output,  this  PSD  is 
multiplied  by  M  ,  the  square  of  the  division  ratio. 

As  shown  in  Figure  8.10,  Sq  vco((a)  falls  off  from  infinity  as  we  move  from  co0. 
Since  M2Ss_ief((ii)  is  flat,  eventually  it  will  become  larger  than  Se_vco(co).  The 
distance,  between  co0  and  when  these  two  PSDs  interesect,  is  defined  as  <bc,  the 
crossover  frequency.  Mathematically  it  is  defined  as: 

Se_vco(co)L§mo+COi :  =M25eJef(co)Ua)o+a)c  =  KM2 

In  terms  of  offset  frequency  we  have: 

S^vco(c»/)Uc  =  M256jef  (co,-)|Q(0e  =  KM2  (8.28) 


Transfer  Functions  for  Phase  Noise  Sources 

Figure  8.11  shows  the  diagram  redrawn  from  Figure  8.1,  but  with  phase  noise  from 
reference  source,  0,_ref2,  and  VCO,  6,  vco2,  explicitly  shown.  Let  us  also  redraw 
Figure  8.10  in  Figure  8.12,  using  offset  frequency  ©i.  In  (7.119),  we  denoted  the 
transfer  function  from  the  VCO  node  to  the  synthesizer  output  as  Hvco.  Similarly,  in 
Chapter  7  we  also  denoted  the  transfer  function  from  the  reference  source  node  to 
the  synthesizer  output  as  Hle!.  We  have  already  used  this  //ref  to  relate  the  following 
disturbances  injected  at  the  reference  source  node  to  the  synthesizer  output:  spur 
from  thePD  [see  (7.15)]  and  phase  noise  from  the  PD  [see  (7.18)].  We  now  use //ref 
to  relate  the  phase  noise  from  the  reference  source  node  to  the  synthesizer  output. 
For  the  frequency  synthesizer  in  Figure  8.1 1  let  us  denote  the  forward  gain  A(s) 
and  the  feedback  factor  B.  Hence 

G(s)  =  A(s)B  (8.29) 

and 

B  =  \/M  (8.30) 

From  classical  feedback  theory  we  know  that  if  the  synthesizer  is  modeled  as  an 
LTI  system,  then 

11 

h™-\  +  a{s)b-\Tg$)  (831) 
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Fig.  8.11    Block  diagram  of  synthesizer  with  phase  noise  sources 
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Fig.  8.12    Power  spectral  density  of  phase  noise  sources  referenced  using  offset  frequency  (co,) 


and 


H, 


ret' 


A(s) 


Ms) 

l+A(s)B      l  +  G(s) 


(8.32) 


Now  what  would  these  transfer  functions  look  like?  From  (8.18)  and  (8.19),  F(s) 
and  G(s)  are  low-pass  functions.  If  we  substitute  (8.18),  and  (8.19)  into  (8.31)  and 
(8.32),  we  find  that  Hvco  is  a  high-pass  function  and  //reI  is  a  low-pass  function  [8]. 
We  should  emphasize  that  terms  like  low-pass  and  high-pass  are  used  with  the 
understanding  that  the  underlying  frequency  variable  is  co,  (not  <d).  First  let  us  apply 
the  approximation:  when  co,<coM,  IG(s)l3>l;  when  ol>,>co„,  IG(.?)I<§;1  [basically 
replace  G(s)  by  0],  then  we  can  approximate  //vco  as  shown  in  Figure  8.13.  Let  us 
apply  the  same  G(s)  approximation  to  //ref  Then  when  co,  <  co„,  Hrei  =  M  and  when: 


co,>fl)„//ref  ^  A(s) 


(8.33) 
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Fig.  8.13    Frequency 
response  of  VCO  transfer 
function 
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Fig.  8.14    Frequency 
response  of  reference  transfer 
function 
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We  now  make  one  more  approximation:  when  co,  >  co„,  \A(s)\  becomes  0. 
Substituting  this  approximation  into  (8.33),  we  have 


//ref   =   0  for  ft),  >  ff>„ 


(8.34) 


This  is  called  the  first-order  approximation  to  Href  at  ft),  >  co,,.  Both  //vco  and  //ref  are 
shown  in  Figure  8.13  and  Figure  8.14,  where  the  exact  responses  are  shown  in 
dotted  lines  and  the  idealized  responses  are  shown  in  solid  lines. 

It  has  been  shown  in  [8]  that  for  a  PLL  with  an  arbitrary  loop  filter  F(s),  the  unity 
gain  frequency  of  the  resulting  open-loop  transfer  function  G(s)  becomes  the  -3  dB 
bandwidth  of  the  closed-loop  transfer  function.  Hence  we  can  conclude  that  0)„  of 
G(s)  is  also  the  -3  dB  bandwidth  for  HTef.  By  the  same  argument,  we  can  conclude 
that  co„  is  also  the  -3  dB  bandwidth  for//vco,  except  that  this  time  //vco  is  a  high-pass 
function.  In  summary,  co,,  of  G(s)  is  the  -3  dB  bandwidth  for  Hvco,  Href. 
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Optimal  5e_Syn 

To  find  5e_syn  in  terms  of  co„,  we  follow  Figure  4.36  and  Fig.4.37  of  [8].  The 
procedure  consists  of  combining  Figure  8.12  (PSD  of  input  phase  noise),  Figure 
8.13  and  Figure  8.14  (transfer  functions)  to  derive  Se_syn. 

What  is  the  best  way  of  designing  co„  to  achieve  the  optimal  Se_syn?  It  seems  that 
an  optimum  Se_syn  can  be  achieved  if  the  following  is  satisfied  [9]: 

co„   =   cof  (8.35) 

Why?  From  Figure  8.12,  for  a),<©0  Sq_vco  dominates,  and  hence  we  want  to 
suppress  it.  From  Figure  8.13  for  <»,<©„,  Hvco  is  zero.  Hence  by  making  ff>r=ff>M,  we 
guarantee  that  at  frequency  when  5e_vco  dominates,  it  is  suppressed  at  the  output. 
Conversely,  from  Figure  8.12,  for  <d;>co0  M  Sq_ts:{  dominates,  and  hence  we  want  to 
suppress  it.  From  Figure  8.14,  for  cfl,>a>M,  //ref  is  zero.  Hence  again  by  making 
(Oc=coM,  we  guarantee  that  at  frequencies  when  S^ref  dominates,  it  is  suppressed  at 
the  output.  In  other  words  [9],  the  loop  filter  will  attenuate  the  VCO  phase  noise 
more  at  frequencies  below  cot.,  where  they  are  greater  than  the  reference  phase 
noise.  Conversely,  it  will  attenuate  the  reference  phase  noise  more  at  frequencies 
above  (oc,  where  they  are  greater  than  the  VCO  noise. 

We  can  restate  the  foregoing  as  follows: 

1.  When  <»,<co„(=a)£.)  Soq_vco  (PSD  of  output  phase  noise  due  to  VCO)  is  heavily 
suppressed.  Hence  5e_syn  is  dominated  by  Soe_ref  (PSD  of  output  phase  noise  due 
to  reference).  However,  Soe_ref  follows  Se_ref.  Hence  Se_syn  follows  Se_ref. 

2.  When  co,>cfl„(=a>c),  exactly  the  opposite  occurs.  Soe_ref  is  heavily  suppressed. 
Hence  Se_syn  is  dominated  by  Soe_vco.  However,  Soe_vco  follows  Sq_vco,  Hence 

Se_syn  follows  Se_vco. 

Combining  the  two  results,  we  get  Se_syn,  which  is  shown  in  Figure  8.15.  Notice 
since  co„=coc,  at  co„,  by  definition  M  5e_ref=5e_vco.  This  is  also  shown  in  Figure 
8.15.  Hence  the  optimal  Sesyn's  dependency  on  G(s)  is  established. 

So  far  we  have  established  the  general  criteria  for  selecting  ff>„.  This  does  not 
allow  us  to  determine  its  specific  value.  For  example,  according  to  (8.28),  coc  can  be 
set  arbitrarily  by  varying  K,  and  hence  co„'s  specific  value  can  also  be  set  arbitrarily. 
To  determine  the  specific  value  of  co„  we  need  the  phase  noise  requirement  as 
explained  in  criteria  1,  and  2. 


8.3.1.3     Crossover  Frequency  (wc),  Unity  Gain  Frequency  (to„) 

From  criterion  1,  subsection  8.3.1.1,  we  are  interested  in  the  integrated  phase  noise. 
Referring  to  Figure  8.15,  integrated  phase  noise  can  be  interpreted  as  the  area 
underneath  the  SQ_syn  curve.  Specifically,  if  we  neglect  close-in  phase  noise  (i.e., 
that  part  of  PSD  that  is  very  close  to  co,=0)  and  assume  that  noise  beyond  co£.  (=ff>„) 
is  negligible,  then 

integrated  phase  noise   £*   2KM2ac  (8.36) 
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Fig.  8.15    Power  spectral  density  of  synthesizer  phase  noise  referenced  using  offset  frequency  (<D;) 


A  tolerable  value  on  this  integrated  phase  noise,  as  explained  under  criterion  1, 
sub-section  8.3.1.1,  is  best  obtained  from  simulation.  Substituting  this  value  in 
(8.36)  we  have  one  equation  and  two  unknowns:  K  and  roc.  Combining  this  equation 
with  (8.28),  we  have  two  equations  and  two  unknowns  and  we  can  solve  for  K  and 
cor.  Applying  this  value  of  a>c  to  (8.35),  ff>„  is  determined. 

Next  we  want  to  check  whether  the  (Bu  we  have  obtained  will  result  in  an  Se_syn 
that  satisfies  criterion  2.  If  this  is  acceptable,  then  the  co„  we  have  selected  has  led  to 
a  design  with  acceptable  phase  noise  performance.  Otherwise  we  have  to  modify 
coM.  To  carry  out  this  check,  let  us  start  with  (8.25),  which  gives  a  value  on  the 


allowable  Se_syn,  Se_aiiowabie_syn(©;)-  Criterion  2  is  satisfied  if 


5'e_syn(0/  —  ©channel 


J   ^  ^0_allowable_syn(©/  —  ©channel) 


(8.37) 


To  obtain  the  proper  value  of  Se_syn  let  us  consider  Figure  8.15.  We  note 
depending  on  the  co„  of  G(s) 


("channel   <  ©h 


(8.38) 


or 


("channel   >  <»« 


(8.39) 


Case  1:  reference  phase  noise  dominates 

If  cdm  is  so  designed  that  (8.38)  is  true,  then  reference  phase  noise  dominates.  Then 
we  can  apply  (8.38)  to  Figure  8.15  and  we  have 


S'e_syn( ©channel)     —    M  Se_ref (©channel)  —  KM 


(8.40) 
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We  then  check  whether  the  Se^y^tBchannei)  obtained  in  (8.40)  satisfies  (8.37). 

If  (8.37)  is  satisfied,  then  co„  is  acceptable.  If  not,  we  can  decrease  K  until  (8.37) 
is  satisfied.  Of  course,  co£.  would  increase  as  a  result  and,  according  to  (8.35),  ©„ 
would  have  to  be  increased.  (8.36)  would  then  have  to  be  checked.  The  process  may 
have  to  be  carried  out  iteratively  until  both  (8.36)  and  (8.37)  are  satisfied. 

Case  2:  VCO  phase  noise  dominates 

If,  on  the  other  hand,  co„  is  so  designed  that  (8.39)  is  true,  then  VCO  phase  noise 
dominates.  Then  we  can  apply  (8.39)  to  Figure  8.15  and  we  have 

S9syn(0>channel)  —  Seovco^channel)  =  S9vco(°>channel)  (8-41) 

We  then  check  whether  the  Se_syn(coChannei)  obtained  in  (8.41)  satisfies  (8.37).  If  this 
is  true,  then  u)M,  is  acceptable.  If  not,  two  alternatives  can  be  pursued. 

In  the  first  alternative  we  can  redesign  co„  such  that  reference  phase  noise 
becomes  dominant  again.  We  can  do  this  by  increasing  <d„  of  G(s)  until  (8.38)  is 
satisfied.  Then  we  can  follow  the  same  procedure  as  described  in  "Case  1:  Refer- 
ence Phase  Noise  Dominates"  to  ensure  that  (8.37)  is  satisfied.  In  the  second 
alternative  we  can  redesign  VCO  and  ff>„  together.  We  first  redesign  VCO  such  that 
5e_vco((Bchannei)  is  lowered  until  (8.37)  is  satisfied.  Then  this  new  Se_vco  is  used  in 
(8.28)  to  calculate  the  new  co£..  Of  course,  the  new  coc.  is  smaller,  and  according  to 
(8.35),  ff>„  would  have  to  be  decreased. 

In  summary,  in  both  cases  we  modify  co„  of  G(s),  if  necessary,  until  (8.37)  and 
(8.36)  are  satisfied. 


8.3.1.4     Other  Poles,  and  Zeroes 

u)u  is  designed  in  the  above  section  for  F(s)  of  any  order.  Other  poles  and  zeroes 
design  depend  on  the  exact  order  of  F(s). 

We  omit  the  other  poles  and  zeroes  design  of  Fi(s)  because  it  is  relatively 
simple.  For  F2(s)  as  shown  in  (8.14)  we  have  already  made 

p2  =  0  (8.42) 

This  will  be  shown  to  help  reduce  spur.  Next  we  note  that  co„  of  G2(s)  [given  in 
(8.15)]  should  be  smaller  than  p3.  This  ensures  that  when  (£>=P3,  which  means 
ZG2(/(B)  =  —135  degrees,  IG2(/co)l  is  less  than  1.  This  guarantees  that  the  synthe- 
sizer is  stable.  Furthermore,  co„  should  be  greater  than  Z\  because  G20'co)  starts  off 
with  two  poles,  p\  and  p2,  at  origin,  giving  a  180-degree  phase  shift.  Hence  if  co„  is 
smaller  than  z1;  then  at  a>„,  G2(j($)  has  close  to  180  degrees  phase  shift  and  the 
phase  margin  of  the  synthesizer  is  very  poor  [6].  This  will  result  in  a  sharp  peak  in  I 
Giifay  [6]-  Hence,  to  avoid  this,  co„  is  set  to  be  greater  than  z\.  Accordingly,  co„  is 
between  z1  and  p3,  as  depicted  in  Figure  8.6b. 
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To  eliminate  the  peaking,  we  need  to  set  z\  precisely.  How?  We  note  that  since 
pj,  is  far  away,  the  PLL  behaves  like  a  second-order  LTI  system,  with  a  damping 
factor  denoted  as  \.  To  eliminate  peaking  we  want  to  set  E,  =  1.  This  means  that 

z1=-fi)„  (8.43) 

We  now  want  to  find  p3  by  making  use  of  the  stability  requirement.  We  note  that 

phase  margin   =  360°  -  180°  +  lG2(j(Ou) 

=  360°  -  180°  -  180°  +  tan"1  °^  -  tan"1  °^  (8'44) 

Zl  p3 

Substituting  (8.43)  into  (8.44), 

■  |  co„ 

phase  margin  =  tan     4  —  tan      —  (8.45) 

Pi 

From  the  specifications,  phase  margin  is  usually  given  and  this  determines  p3. 

For  a  higher-order  filter  with  a  transfer  function  F(s)  [as  described  in  (8.18)],  we 
first  assume  for  the  same  reasons  as  in  the  F2(s)  case  that  (8.42)  holds.  We  further 
assume  that  the  higher-order  poles,  and  zeroes  (p4. .  .pn,  z2. .  .zm)  are  so  far  away  that 
(8.43),  and  (8.44)  still  hold.  We  can  therefore  determine  z\,  p?,  of  F(s)  using  the 
same  procedure  in  F2(s)'s  case.  The  higher-order  poles  and  zeroes  are  just  for  fine- 
tuning  of  the  synthesizer's  performance,  and  their  assignments  (for  typical 
applications)  can  be  rather  loose  as  long  as  they  are  far  enough  from  p3. 

We  have  now  come  up  with  an  initial  design  of  F(s)  based  on  phase  noise  and 
stability.  We  will  have  a  chance  to  do  a  complete  design  using  this  approach  in 
section  8.4,  when  we  would  also  check  if  the  spur  requirement  is  satisfied. 


8.3.2     Spur-Based  Approach 

In  this  approach  we  design  F(s)  based  on  primarily  spur  and  stability  requirements. 
The  resulting  synthesizer  can  then  be  checked  and  see  if  it  meets  the  phase  noise 
requirement.  We  again  denote  co„  as  the  unity  gain  frequency  of  the  resulting  G(s). 
First  we  omit  the  design  of  Fi(s)  because  it  is  rather  simple.  For  second  and  higher- 
order  filters,  we  follow  sub-section  8.3.1.4  and  determine  p2,  according  to  (8.42). 
We  then  relate  z1;  p3  to  co„  via  (8.43)  and  (8.45). 

Next,  to  determine  co„,  we  again  bring  in  the  spur  consideration.  To  save  space, 
we  skip  the  discussion  of  spur  requirements.  We  assume  that  spur  requirements 
have  been  derived  in  a  procedure  similar  to  the  derivation  of  phase  noise 
requirements  due  to  reciprocal  mix  effect  in  sub-section  8.3.1.1.  Remember  that 
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vco 


Fig.  8.16    Block  diagram  of  synthesizer  with  spur  sources 

[from  (7. 17a)]  spur  depends  on  60,  which  according  to  (7.15),  depends  on  Hre!.  Hle{ 
in  turn  depends  on  F(s).  Hence  our  goal  is  to  find  the  dependency  of  spur  on  F(s). 
First,  Figure  8.16  is  redrawn  from  Figure  7.12,  with  divider  M  in  the  feedback 
path.  Following  (8.29)  and  (8.30),  we  call  the  forward  gain  A{s),  where 


A(s) 


KpdF(s)Kv, 


(8.46) 


and  the  feedback  factor  B,  where 


B 


1 
M 


(8.47) 


Then  from  feedback  theory  we  can  immediately  write 


Href 


Ms) 


1  +A(s)B 


(8.48) 


If  we  follow  the  first-order  approximation  to  Hvei  at  a>,>(0„  as  given  in  (8.34), 
this  creates  a  problem  because  the  spur  frequency,  a)spur,  typically  is  larger  than  co„. 
Hence  this  approximation  will  make  //ref  =  0.  Substituting  this  in  (7.15),  predicts  a 
0o  of  0,  or  zero  output  spur,  which  is  clearly  an  underestimation  and  will  not  help  us 
in  the  design  process.  We  can  overcome  this  problem  by  going  back  to  (8.33), 
which  is  repeated  here: 


Hle{   9i  A(s),a>i  >  ro„ 


(8.49) 


We  will  find  another  approximation  to  A(s).  Referring  to  (8.46)  we  can  do  that  by 
finding  another  approximation  to  F(s),  as  long  as  it  does  not  lead  to  (8.34).  To  do 
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this  we  start  from  the  loop  filter  function  F(s)  in  (8.18)  and  approximate  it  by  the 
following  form  [12]: 

*(')=7rr£nfe    \  (8-50) 

(s+p2)...(S+p„) 

This  will  make 

KpdKWC0  f         {s  +  z{)         \ 

Hlef  =  -^ — — r^- r    for  co,  >co„  (8.51) 

*        \{s+P2)-(s+p„)J 

This  is  called  the  second-order  approximation  to  //ref  at  co,>coH. 

Turning  back  to  F(s),  we  note  that  the  approximation  consists  of  assuming  that 
F(s)  has  only  one  zero  and  that  F(oo)  is  set  to  1.  Then  we  substitute  (8.50)  into 
(8.46)  to  get  A(s).  Following  that,  we  substitute  the  resulting  A(s)  in  (8.48)  and  we 
get  the  complete  //ref  as: 


^ref  — kSf(,\k  (8.52a) 


sM 


v-  (s+zi)  ^ 

"-Pds(j+/j2)...(.s+/j„)ft-vco 

liP-        1  js+zi)  Ts 

1  "r  ^Pd  sM  (s+p2)-(j+P»)      v' 


(8.52b) 


We  make  three  comments  about  this  transfer  function: 


1 .  i\  typically  is  close  to  origin  and  so  the  pole  due  to  VCO  is  canceled  by  this  zero. 

2.  Even  though  F(s)  is  approximated  differently,  //ref  still  has  its  -3  dB  frequency 
at  co„.  This  means  that  //ref  has  a  factor  -£— . 

3.  When  we  put  this  simplified  F(s)  inside  the  synthesizer  loop,  we  assume  that 
since  pi,,..  pn  of  F(s)are  so  far  away  from  co„  that  these  poles  do  not  move  much. 

From  comments  1  through  3,  we  can  further  simplify  (8.52b)  as 

K 

"ref  —  7 — . ^7 — . 7 -, — . 7  (8.53) 

{s  +  au)(s  +  p3)  ■  ■  ■  (s  +  p„) 

Only  K  now  remains  an  unknown.  At  low  frequency:  that  is,  when  co,  <C  co„,  we 
still  have  \G(s)\  =  \A(s)B\  >■  1.  Applying  this  approximation,  to  (8.48)  and  it 
becomes 


HIef 


Ms) 


<*««.       l+A(5)filco'<<co" 

1 


(8.54) 
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Substituting  (8.47)  into  (8.54), 

#ref  !<»,.<«,„  =M  (8.55) 

Again  at  low  frequency,  a),<Cco„,  (8.53)  becomes 

K 

Href  U«co„  —  (8.56) 

GV>3  •••Pn 

(This  is  because  at  low  frequency  s<§;cou,  <Cp3. .  .<§Cpn)  Equating  (8.55)  and  (8.56), 

K 

M  = 

or 

K  =  aup3---p„M  (8.57) 

Substituting  (8.57)  back  into  (8.53),  we  finally  determine  the  complete  //ret 
expression: 

co„/?3  •  •  •  p„M 

Hmf  =  7 — ; w — ; r~, — ; r  (8.58) 

Notice  that  since  comment  3  assumes  that  the  higher  order  poles  do  not  move 
upon  closing  the  loop,  hence  (8.58)  does  not  account  for  peaking  of  the  closed-loop 
response  near  ff>„.  Remember  that  from  (7.17b),  spur  is  related  to  902  as 


spur_power    _(^\^  (8.59) 


camer_power       .     ,    . 

To  calculate  0o  we  substitute  (8.58)  into  (7.15),  and  we  get: 
.  wup3..p„M  1 


(s  +  (0u)(s  +  Pi)..(s  +  pn)  KvA 


w  (8.60) 


If  we  replace  60,  ispm  by  its  amplitude  and  express  everything  in  decibels  we 
have 


spur|dBc  =  10  log   »»—»—    =  10  log  6    =  20  log  0o 
carrier- power 


=  20  log 


®up3---p„M  1_ 


jspur  (amplitude) 

(s  +  au)(s  +  p3)  ■  ■  ■  (s  +  pn)  K, 


(8.61) 
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Here  amplitude  of  G0  is  denoted  by  the  notation  spur,  which  actually  represents 
the  spur  ratio.  We  use  spur  to  denote  both  the  spur  component  and  the  spur  ratio.  Its 
actual  meaning  should  be  clear  from  the  context.  Obviously,  spur  depends  on  the 
spur  frequency,  cospur.  We  want  the  loop  filter  to  filter  out  the  spur  at  cospur. 
Therefore,  <Dspur  7$>  co„  (usually  on  the  order  of  at  least  10  or  more).  Typically,  I 
/^spurl  ^  W.  J/7/;'  as  weil  and  therefore  in  the  denominator  of  (8.61)  we  have 


S  +  p3  ^  S  +  p4  QH  ■  ■  ■  ^  S  +  p„  ^  JCOspur 

We  now  substitute  (8.62)  into  (8.61).  Then  (8.61)  becomes 
(Oup3---p„M    1 


SPUrldBc=201°g 

=  20  log 


K»B, 


K„ 


spur  "~pd 

(Oup3  ■  ■  ■  pnM  271 


-/spur(amplitude) 


0). 


spur 


/. 


charge  _pump 


-/spur  (amplitude) 


(8.62) 

(8.63a) 

(8.63b) 


Note  that  going  from  (8.63a)  to  (8.63b)  we  have  used  the  definition  Kpd  = 
4harge_pump/2Tt,  obtained  from  (7.9).  What  is  ;spur(amplitude)?  From  (7.7), 


amplitude  of  ;spur   =   d   x   /charge_pump 

Substituting  this  in  (8.63b),  we  have 

(Qupi---p„M        2% 


SPUrldBc=201°g 

=  20  log 


CO 


spur 


a>uP3---p„M 


'  charge  _pump 


2n  x  d 


"  ^  *■  charge  _pump 


CO. 


spur 


(8.64) 

(8.65a) 
(8.65b) 


Now  that  we  have  established  the  dependency  of  spur  on  F(s),  how  do  we  finish 
our  design  of  the  loop  filter?  For  a  second  order  filter,  we  start  from  (8.50)  and  retain 
only  z\,  pi,  pn.  F(s)  then  becomes 


Ms) 


S  +  Zl 


(s+p2)(s+p3) 


(8.66) 


Previously,  we  made  P2=0  to  get  some  preliminary  spur  reduction.  Then  F2(s) 
becomes 


Ms) 


With  this  F2(s),  (8.65b)  becomes 


s  +  zi 
s(s  +  p3) 


(8.67) 


spur|, 


20  log 


C0„/?3 
C0?„,... 


M2nd 


(8.68) 
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It  should  be  noted  that  spurldBc  and  fflspur  in  (8.68)  are  given  in  the  spur  specifi- 
cation and  d  is  a  mismatch  and  is  given  for  a  given  technology.  As  stated  in  sub- 
section 7.3.4.2,  d  typically  is  around  0.1%.  Hence  (8.68)  gives  an  equation  that 
relates  the  two  unknowns:  p3,  co„.  We  have  stated  that  p3  is  related  to  co„  through 
(8.45).  Hence  when  we  look  at  (8.45)  and  (8.68),  we  find  that  we  have  two 
equations  and  two  unknowns:  co„,  p3.  With  two  equations  and  two  unknowns  we 
can  solve  for  co„,  p3,  and  then  we  can  go  back  to  (8.43)  and  solve  for  Z\.  F2(s)  is  then 
completely  determined.  For  higher-order  filters,  we  will  follow  the  same  argument 
as  given  at  the  end  of  sub-section  8.3.1.4  and  extend  the  results  from  F2(s)  to  F(s). 
In  summary  this  section  shows  us  how  to  come  up  with  an  initial  design  on  F (s) 
based  on  spur  and  stability  requirement. 

We  have  now  finished  our  discussions  of  synthesizer  subblocks  and  have  given 
design  procedures  for  all  of  them.  Next  we  put  the  pieces  together  and  see  how  they 
fit  with  each  other. 


8.4    A  Complete  Synthesizer  Design  Example  (DECT 
Application) 

Figure  8.17  is  a  flowchart  that  shows  how  to  design  a  PLL-based  frequency 
synthesizer.  First,  in  the  flowchart  we  choose  to  design  the  loop  filter  based  on 
phase  noise;  that  is,  we  follow  subsection  8.3.1.  If  we  had  chosen  to  design  a  loop 
filter  based  on  spur  requirements  (i.e.,  sub-section  8.3.2),  the  flowchart  will  be 
different. 

Second,  the  key  parameters  to  design  are  Kvco,  Kpd,  F(s).  Of  course,  to  determine 
Kvco,  we  need  to  do  a  detailed  design  of  the  VCO,  and  to  determine  Kpd,  we  need  to 
do  a  detail  design  of  the  PD/charge  pump.  In  both  cases,  the  W/L  ratio  and  bias 
currents  need  to  be  calculated.  As  for  the  loop  filter,  the  R,  C  have  to  be  designed. 
Notice  that  as  opposed  to  some  traditional  design  methodologies  [7]  used  for 
discrete  circuit  design,  the  present  methodology  is  more  suitable  for  integrating 
the  entire  synthesizer  on  a  chip.  The  major  difference  is  that  here  we  do  not  assume 
that  we  can  have  individual  blocks  (such  as  PD,  VCO)  available  from  off-the-shelf 
components,  whose  design  values  for  A"vco,  A"pd  span  a  large  range  (due  to  the  fact 
you  can  mix  different  technologies,  such  as  GaAs,  bipolar,  MOS).  Instead,  we  are 
designing  everything  in  one  technology  (CMOS)  and  so  the  range  of  Kvco  and  Kpa  is 
more  restricted.  Consequently,  some  of  the  steps  should  follow  a  specific  order 
(e.g.,  VCO  should  be  designed  before  loop  filter). 

Third,  as  will  be  shown,  during  the  design,  if  we  are  not  careful,  passive 
components  such  as  resistors  and  capacitors  can  have  design  values  too  large  to 
be  realizable  in  integrated  form.  It  is  our  philosophy  to  avoid  this.  In  addition,  we 
make  sure  that  the  design  for  transistors  is  reasonable  [e.g.,  we  do  not  end  up  with  a 
(j)  ratio  of  thousands  or  much  less  than  one,  and  that  bias  current  is  on  the  order  of 
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Fig.  8.17    Flowchart  of 
synthesizer  design  procedure 


Start 


1 

r 

Step  1:  Identify  key  specs: 

f0,  frequency  span,  division  ratio, 

power,  phase  noise,  stability,  spur. 

Design  divider 

< 

V 

Step  2:  VCO  design: 

Select  VCO  architecture 

Design  delay  cell,  Kvco 

to  meet  f0,  span  and  power  specs. 

>r 

Step  3:  Loop  filter  design  1: 

cou,  poles,  zeroes 

Check  and  see  if  phase  noise 

and  spur  specs  are  met 

No 

Yes 

Step  4:  Phase  detector  design: 

1 


Step  5:  Loop  filter  design  2: 
scaling  factor 


milliampere).  This  will  give  readers  the  feeling  that  they  are  doing  a  realistic  design 
and  allow  them  to  map  the  design  to  their  application. 

We  now  illustrate  this  methodology  by  going  through  an  example  step  by  step, 
using  realistic  numbers  for  a  typical  application.  Consistent  with  previous  chapters, 
we  do  the  design  to  satisfy  the  DECT  standard.  We  assume  that  the  technology  is  a 
0.6  um  CMOS  process. 


8.4.1     Specifications 


For  DECT  application,  we  assume  that  a  direct  conversion  architecture  is  used. 
The  key  specifications  relevant  to  the  frequency  synthesizer  are  summarized  in 
Table  8.1. 
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Table  8.1    Specifications  of  frequency  synthesizer  for  DECT  application 

a)  Power  consumption  <  33  mW  (mostly  consumed  by  VCO)  [2] 

b)  Frequency  of  operation 

i)  f0  has/vco  center  =  1.9  GHz  and  sweeps  from  1.884  GHz  to  1.9  GHz. 
ii)  /chan„el=  1-728  MHz 

c)  Phase  margin  >  55  degrees 

d)  Phase  noise 

i)  Integrated  phase  noise  <  —20  dBc 
ii)  Phase  noise  at  1  channel  offset  <  —102  dBc/Hz 

e)  Spur  <  -30  dBc 

Notice  that  specs  (b)  and  (d)  are  direct  results  of  DECT  standards.  For  example, 
(b)  comes  from  Table  1.1.  We  assume  that  simulation  is  performed  according  to  the 
discussion  in  8.3.1.1,  which  gives  us  (d),  condition  (i);  (d),  condition  (ii),  comes 
from  (8.26). 

To  simplify  the  discussion  the  synthesizer  is  assumed  to  consist  of  only  the 
feedback  divider  with  a  variable  division  ratio  M  as  shown  in  Figure  8.1.  Channel 
selection  is  therefore  achieved  by  varying  the  divider  ratio  M. 

Since  only  a  feedback  divider  is  used,  the  minimum  resolution  is  1.  Since  the 
channel  spacing  is  1.728  MHz,/,,  is  equal  minimum  resolution  times  the  channel 
spacing,  and  therefore/.  =  1.728  MHz.  Since /„  varies  from  1.884  GHz  to  1.9  GHz, 
the  division  ratio  will  vary  from  p§^§|  to  ,  '-^^  or  M=  1090  to  1099.  Hence  the 
divider  as  shown  in  Figure  7.15,  can  be  used. 


8.4.2     VCO 

This  step  depends  on  which  VCO  architecture  we  have  selected.  Since  we  are 
interested  in  integration  (and  minimize  external  components  such  as  L,  C)  we 
choose  the  ring  oscillator  [2]  as  shown  in  Figure  7.40.  What  is  Nl  In  Section  7.7, 
we  noted  that  this  depends  on  a>0  and  technology.  In  this  example  we  have 
arbitrarily  picked  this  to  be 

N   =  4  (8.69) 

To  carry  on  the  analysis,  we  have  to  make  some  assumptions  on  the  internal 
details  of  the  ring  oscillators.  Three  types  of  ring  oscillators  have  been  discussed  in 
sub-section  7.7.1  (each  with  both  single-ended  and  differential  implementations). 
For  the  present  example  we  choose  the  one  whose  delay  cell  was  shown  in  Figure 
7.44.  This  is  redrawn  in  Figure  8.18,  with  A/tune  replaced  by  Mw  and  Mu.  The  P bias 
for  M3,  M4  is  also  shown  explicitly  and  is  set  to  be  a  constant.  Finally,  the  loading 
from  the  next  stage  delay  cell  is  shown.  We  are  now  going  to  design  a  VCO  that 
oscillates  fast  enough  to  achieve  f0  and  so  that  its  Kvco  is  sufficient  to  attain  all 
channels  while  staying  within  the  power  specification  [basically  satisfying  specs 
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tjne_bias 
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Next  Delay  Cell 
H|Mis  jh   i 


JMis  ^ 


f 


L47    Ma^Ve 


Bias  Circuit  Delay  Cell 


Fig.  8.18    Slow-slewing  saturated  delay  cell 


Tuning  Circuit 


(a),  and  (b)  of  Table  8.1].  We  should  of  course  check  whether  this  VCO  generates 
excessive  phase  noise  at  the  synthesizer  output.  This  check  will  be  postponed  until 
after  the  loop  filter  is  designed  in  the  next  section  (section  8.4.3).  If  the  phase  noise 
generated  is  excessive  we  may  have  to  come  back  to  this  section  and  redesign  the 
VCO  (perhaps  by  relaxing  the  power  specification). 

First  let  us  rewrite  the  formula  for/0  from  (7.100),  where  C  is  relabelled  C\. 


fo 


1 

2NRC,ln2 


(8.70) 


As  shown  in  (8.70), /0  depends  on  C/.  Now  C/  is  the  loading  capacitance  on  either 
Vout+  or  K>ut~-  Let  us  consider  C/  on  Vout+  only.  From  Figure  8.18,  C/  depends  on 
Mi5,  Mi,  M3. 

Next  let  us  repeat  the  formula  for  A"vco.  Since  the  bias  for  M3,  M4  (=/"bias  m 
Figure  7.44)  is  set  to  a  constant,  we  can  repeat  the  Kvco  formula  from  (7.106). 


A\,, 


-2%f0RGml-% 


\Vgss\  ~  \Vt\  ~  \VDSi\ 


,71) 


Here  M7-M%  are  used  for  tuning  and  hence  G„,7  g  replaces  g„lM  .  Finally,  let  us  write 
the  formula  for  the  power  specification.  In  Figure  8.18,  /bias  3>  Aune-  Hence  power  in 
the  tuning  circuitry  can  be  neglected.  Furthermore,  the  bias  circuit  is  shared  among  all 
delay  cells  and  hence  its  power  consumed  per  delay  cell  can  be  neglected.  Conse- 
quently, power  for  a  ring  oscillator  with  N=4  can  be  approximated  as 


power  =   4/bias  x  Vdd 


(8.72) 


This  /bias  is,  in  turn,  related  to  the  design  of  M3  by  the  following  formula: 
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Vswing  =  Alias  x  Rm3  (8.73) 

Here  RM3  is  the  resistance  of  M3  that  is  biased  in  the  triode  region  and  VSWing  is 
the  voltage  swing  of  the  delay  cell. 

To  summarize,  f0  depends  on  Mu  M3,  M15.  Kvco  depends  on  M3,  M7.  Power 
depends  on  M3.  There  are  many  methods  of  attacking  such  a  complicated  design 
task.  It  seems  that  starting  with  the  power  specification  is  the  simplest,  as  it  has 
dependency  on  only  one  transistor,  M3.  Hence  this  step  is  subdivided  into  three 
parts,  where  we  design  M3,  then  M1;  and  then  M7.  To  elaborate,  we 

1.  Design  the  load  M3  to  satisfy  low  power  requirements  (required  to  be  less  than 
33  mW).  Here  we  assume  that  the  bulk  of  the  PLL  power  comes  from  VCO 
(although  the  divider  will  also  consume  power,  in  this  simple  example  we 
assume  it  to  be  negligible  compared  with  the  VCO  power). 

2.  Design  the  G„,  of  the  input  differential  pair  (Mi,  M2)  so  as  to  satisfy  the  f0 
requirement.  This  makes  the  VCO  oscillate  at  a  DECT  center  frequency  of 
1 .9  GHz  and  allows  us  to  determine  Cj . 

3.  Design  the  G„,  of  the  tuning  circuit  (M7,  M8)  and  hence  the  A"Vco  to  attain  the 
frequency  sweep  from  1.884  GHz  to  1.9  GHz. 


8.4.2.1     Design  for  Power  Specification 

Step  1:  Determine  Ibias-  First,  from  the  power  specification,  set  power  consump- 
tion to  be  33  mW.  Assume  that  all  power  is  consumed  in  the  VCO.  Substituting  this 
in  (8.72)  we  have 


Solving,  we  have 


4  x  /bias  x  3.3  V   =   33  mW  (8.74) 


/bias    =   2.5  mA  (8.74a  8.74b) 


Step  2:  Calculate  (f-)3-   Next  we  want  to  determine  the  W/L  ratio  of  M3.  This  can 
be  determined  from  the  resistance  of  M3. 

RM2=^m  (875) 

From  Figure  4.2,  the  maximum  current  flowing  through  M3,  IM^      is 

V,mx  =  4ias  =  2.5mA  (8.76) 

This  happens  when  all  /bias  is  steered  to  the  branch  containing  M3. 

According  to  [2]  Vsv,ing  should  be  picked  to  ensure  that  the  PMOS  transistor  is 
deep  enough  in  triode  so  that  RM^  is  sufficiently  linear.  We  follow  the  value  selected 
in  [2]  and  choose 
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^wing    =    1  V  (8.77) 

Substituting  (8.76)  and  (8.77)  in  (8.75), 


Vswing^       IV 

/Bias        2.5mA 


RM}  =  -^l  =  _il_  =  400Q  (8.78) 


If  we  set  RM^  equal  to  the  small-signal  resistance  of  M3  in  the  triode  region,  then 
we  have 


1 
kP(%)3(\VGs3\~\Vt\-\VDS3\) 


rm,  =  ,   /HA  „„     , — —, ^^  (8.79) 


Vqs  lS  made  high  to  favour  a  high  f0  [2] .  The  upper  limit  is  the  power  supply 
voltage.  Hence  we  choose 

|VGS3|  =  3.3V  (8.80) 

Notice  that  to  make  the  small  signal  resistance  of  M3  as  given  by  (8.79)  truly 
represent  the  resistance  RM^,  we  should  bias  M3  at  the  midpoint  of  the  Vdi3  vs  I& 
curve[2].  This  is  the  point  when  \VDgj,\  —  VSWing/2-  Hence  we  set 

|VDS3|=Vswing/2  =  0.5V  (8.81) 

Substituting  (8.78),  (8.80),  and  (8.81)  in  (8.79), 

400Q  =      /w. (8.82) 

kp(f)3(33V-Vt-0.5V) 

Let  us  assume  that  our  0.6  um  CMOS  process  has  ^=50uA/V2,  V,=  YW. 
Substituting  these  in  (8.82),  we  find 

0.6  urn 


8.4.2.2     Design  for  f0 

Next  we  want  to  determine  (^)  j .  To  do  this  we  first  have  to  calculate  C/,  the  load 
capacitance.  Let  us  substitute  condition  b(i)  in  Table  8.1,  (8.69),  and  (8.78)  into 
(8.70) 
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1.9  GHz  = (8.84) 

2x4  x400Qx  C/ln2 

Solving, 

C,   =   0.24  pF  (8.85) 

From  Figure  8.18,  we  see  that  Cgs  (M15)  dominates  the  contribution  to  C\.  Hence 

C,  =  Cgs(Ml5)  (8.86) 

Therefore, 

Cgs(Ml5)  =  0.24pF  (8.87) 

Also,  since  each  delay  cell  in  the  oscillator  is  assumed  to  be  identical,  we  assume 
that 

v^[vrdso  (8-88) 

Cgs(Ml5)  =  Wl5  x  Li5  x  Cox  =  W]  x  Lx  x  Cox  =  Wi  X  0.6  \im  x  Cox     (8.88a) 
For  this  0.6  urn  process,  we  assume  that 

Cox  =   2.5  fF/um2  (8.88b) 

Substituting  (8.87),  and  (8.88b)  into  (8.88a)  we  have 

0.24p/  =  Wi  x  0.6z*  x  2.5/F/um2  (8.89) 

Solving  we  have 

'W\        160um 


L  J  j     0.6  um 


8.4.2.3     Design  for/sweep,  Determine  Kvm 

Substituting  condition  b(i),  Table  8.1,  in  the  definition  of  Kvco  as  given  by  (7.70) 
„     I  df0  /max  -/mm  1 .9GHz  -  1 .884GHz 

Kvco|inHz/V  —  -7T,         ~  T,  ~- 7,  —  "^  ~- 7,  (°.yi) 

*^*^tune  Mune_min         Mune_max  *tune_min         Mune_max 

The  tuning  range,  which  equals  VtUne_max  -  Vnme_mm,  is  upper  bounded  by  Vdd 
and  lower  bounded  by  noise.  Following  [2],  we  choose 

VtMax=    1-3V  (8.92) 
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and 

VWmin  =  0  V  (8.93) 

Substituting  (8.92)  and  (8.93)  in  (8.91),  we  have 

Kvco   =    -0.016  GHz/1.3  V   =    -  12.3  MHz/V  (8.94) 

Let  us  build  a  safety  margin  by  setting 

Kvco   =    -  15  MHz/V  (8.95) 

Substituting  (8.95),  (8.80),  (8.81),  and  (8.78)  in  (8.71)  and  rearranging,  we  have 

-15MHz/Vx(|Vgb3|-|V,|-|Vw3|) 


Gm7_8= 


-15MHz/V  x  (3.3V  -  IV  -  0.5V) 


-1.9  GHz  x  400Q 

=  28uA/V  (8.96) 

Referring  again  to  the  tuning  circuit  in  Figure  8.18,  from  the  definition  of  G„,  we 
have 

Mune_max        Mune_min         ^ml— 8\  Mune_max         Mune_min/  \P*^'J 

Now/tune_mm   =  0  (8.98) 

Substituting  (8.92),  (8.93),  (8.96)  and  (8.98)  into  (8.97)  and  solving, 

W_max    =  28uA/V   x  1.3  V   =  36uA  (8.99) 

Since  7tune_bias  has  to  be  larger  than  /tune_max,  let  us  set 

/tune_blas     =    40uA  (8.100) 

Next,  for  any  differential  pair,  its  G„,  is  related  to  the  (W/L)  and  bias  current  as: 


Gw7_r  = 


7-8  =  ^//tune-Mas  X  k„  X   {W/L),  (8.101) 

Given  k„  =  125  \i.A[W2  for  the  present  process,  substitute  this  value,  (8.96), 
(8.100)  into  (8.101).  Then 


28  iiA/V  =  J 40  \xA  x  125  \iA/V2  x  (W/L)7  (8.102) 
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Solving, 

(W/L)7=  0.16  (8.103) 

This  (W/L)7  may  be  small.  It  can  be  circumvented  by  making  the  gain  of  current 
mirror  Afn-M !  2  to  be  smaller.  We  assume  that  is  being  done  in  the  present  example 
and  we  set  a  designed  (W/L)-,  of 

(W/L)7=    1  (8.104) 

We  have  now  completed  our  design  of  VCO. 


8.4.3    Loop  Filter:  Part  1 

For  our  present  application,  experience  shows  us  that  a  second-order  filter  is  a  good 
choice,  in  particular  to  smooth  out  voltage  jumps  during  switching  transients  of  the 
charge  pump  circuit  [4].  Therefore,  we  use  the  filter  F2(s)  described  in  (8.1 1).  How 
do  we  determine  all  the  parameters  of  F2(s)?  We  choose  to  follow  sub-section  8.3.1 
to  design  F2(s). 


8.4.3.1     Determine  Crossover  Frequency  (coe)  Using  Integrated  Phase  Noise 
Specification,  and  Unity  Gain  Frequency  (wu) 

From  condition  d(i),  Table  8.1  we  set  integrated  phase  noise  as  -20  dBc.  We 
substitute  this  into  (8.36),  rewrite  the  formula  in  hertz,  and  we  have 

-20dBc   =   2KM2fc  (8.105) 

In  addition,  using  (8.28),  we  have  another  equation 

5e_vco(ra/)|Qc0;  =  M2S9jet(co/)|aco<  -  KM2  (8.106) 

In  our  present  example,  we  are  interested  in  the  ring  oscillator,  so  Sq_vco  is  given 
by  (7.118).  Expressing  (8.106)  in  hertz  and  substituting  (7.118),  we  have 

To  obtain  the  parameters  needed  to  solve  (8.107),  we  do  the  following:  first  we 
calculate  VGs\-Vt  by  substituting  (|) ,  from  (8.90)  and  7bias  from  (8.74b)  in  the  I-V 
equation  of  a  SCP: 
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If  =  ki(j)i(VcSl~Vt)2  (8.108a) 

which  becomes 

2.5mA       125uA/V2  /160u»A  2 


V0.6  \x 


(VGSl-Vty  (8.108b) 


Solving,  Vgsi-V,  =  0.27  V. 

Next,  from  [1],  there  is  a  range  for  av  and  £.  For  example  av,  the  gain,  has  to  be  at 
least  larger  than  1  to  ensure  that  the  loop  gain  is  larger  than  1  to  sustain  oscillation. 
We  set  the  values  on  the  high  side.  Therefore,  we  set  av  to  be  3,  £=2.8.  kT  is  always 
4.14xlO"21J. 

Finally,  from  condition  b(i),  Table  8.1  f0  =  1.9  GHz,  and  from  (8.74b),  7bias  = 
2.5mA. 

Substituting  all  these  values  in  (8.107),  we  have 

258 

— r  dBc/Hz  =  KM2  (8. 109) 

fc 

Now  we  have  two  equations,  (8.105)  and  (8.109)  and  two  unknowns,  K  and/t.. 
We  can  then  solve  for/c: 

fc   =   51.6  KHz  (8.110) 

Also,  from  sub-section  8.4.1  we  obtain  the  maximum  value  of  M: 

M  =    1099  (8.111) 

Substituting  (8.110)  and  (8.111)  into  (8.105),  we  solve  for  K: 

K  =    -  131  dBc/Hz  (8.112) 

Finally,  we  substitute  (8.110)  in  (8.35)  and  we  have 

/„    =   51.6  kHz  (ft),,   =   324krad/s)  (8.113) 


8.4.3.2     Check  Phase  Noise  Specification  Imposed  by  Reciprocal  Mixing 

Now  that  we  have  obtained  co„  of  G2(s),  have  designed  a  VCO  and  have  determined 
all  the  parameters  in  (8.107),  we  want  to  check  and  see  whether  the  G2(s)  so 
designed  does  lead  to  a  synthesizer  that  satisfies  condition  d(ii),  Table  8.1.  To  do 
this  we  follow  the  procedure  in  sub-section  8.3.1.3.  First  we  substitute  (8.1 13)  and 
condition  b(ii),  Table  8.1,  in  (8.39)  and  we  note  that  (8.39)  is  satisfied.  Hence  we 
should  follow  the  procedure  in  case  2  of  sub-section  8.3.1.3. 
Next  we  repeat  (7.118) 
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s'-^'=d)^&3 


With  parameters  we  have  chosen  before,  this  becomes 

258 

J; 


Se.vco U)=  -~FT  dBc/Hz  (8-115) 


From  condition  b(ii),  Table  8.1  we  know  that  /channel  =1.7  MHz.  Substituting 
this  in  (8.1 15)  and  we  have 

Se_vco(/channel)  =  "100.5  dBc/Hz  (8.116) 

Substituting  this  in  (8.41),  we  have 

Se_syn (/channel)  -  "100.5  dBc/Hz  (8.117) 

Equation  (8.117)  just  meets  condition  d(ii),  Table  8. 1 .  Hence  there  is  no  need  to 
go  back  to  sub-section  8.4.2  to  redesign  the  VCO. 


8.4.3.3     Determine  Other  Poles,  and  Zeros 

We  next  determine  Z\  from  co„  and  use  condition  (c)  in  Table  8.1  to  nnd/?3.  This  can 
be  done  by  following  the  same  procedure  as  described  in  sub-section  8.3.1.4.  First, 
rewriting  (8.43),  we  have 

—  =  4  (8.118) 

Next  let  us  go  back  to  condition  (c)  in  Table  8.1,  and  set  the  phase  margin  to  58°. 
Substituting  this  in  (8.45)  and  solving,  we  have 

—  =3  (8.119) 

Substituting  (8.113)  into  (8.118),  we  have 

zi  =  81  Krad/s  (8.120) 

Substituting  (8.113)  into  (8.119),  we  have 

Pi  =  972  Krad/s  (8.121) 

As  a  side  note  dividing  (8.14b)  by  (8.14a),  we  get 

b  =  —  (8.122) 

zi 

Substituting  (8.120)  and  (8.121)  in  (8.122),  we  have 
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b  =  12  (8.123) 

8.4.3.4     Check  Spurs  Specification 

Now  that  we  have  obtained  <au  and  all  poles  and  zeroes  of  G2CS),  we  want  to  go  back 
and  check  if  our  design  satisfies  condition  (e),  Table  8.1.  To  do  this  we  make  use  of 
the  discussion  offered  in  sub-section  8.3.2  to  find  the  output  spurs.  As  discussed  in 
sub-section  8.3.2,  we  have  assumed  that  the  single  most  important  spurious  source 
is  from  the  PD.  We  jump  ahead  a  bit  to  sub-section  8.4.4  and  learn  that  we  will  be 
using  a  PFD  for  our  PD.  Hence  (osp„r  =  co,  =  2jc  x  1.728  Mrad/s. 

To  find  the  output  spur  for  our  present  design,  we  use  (8.68),  repeated  here: 


sPurldBc=201og 


— -^M2nd 

®spur 


(8.124) 


For  the  present  technology  we  assume  that  we  are  given  a  mismatch  that  leads  to 

d  =  0.001  (8.125) 

Substituting  (8.113),  (8.111),  (8.121),  (8.125),  and  aspw=2nx  1.728  Mrad/s  into 
(8.124),  we  have 


SPUrldBc=201°g( 


324Krad/s  x  972Krad/s  x  1099  x  2n  x  0.001 


(10.2Mrad/s)2 


=  -35  dBc  (8.126) 

This  satisfies  condition  (e)  of  Table  8.1. 

At  this  point  in  the  design  we  have  finished  picking  all  the  poles  and  zeroes  as 
well  as  the  <d„  of  G2(s)  that  allows  the  synthesizer  to  meet  the  specifications.  How 
does  this  information  allow  us  to  determine  F2(s)r!  Referring  to  (8. 1  la)  to  determine 
F2  we  need  to  find  z\,  /?3,  C3.  To  obtain  these  values  we  recognize  that  p3,  Z\,  for  F2 
are  the  same  as  Z\,  p$  of  G2,  which  have  already  been  determined. 

We  ask  ourselves,  have  we  finished  designing  the  loop  filter  F2(s)  since  we  have 
determined  all  its  poles  and  zeroes?  The  answer  is  no.  This  is  because  F2(s)  takes 
current  as  the  input  and  generates  voltage  as  the  output.  Hence,  besides  poles  and 
zeroes,  this  transfer  function  needs  a  term  that  transforms  current  to  voltage.  When 
F2(s)  is  expressed  in  the  form  given  in  (8.13),  it  can  be  seen  that  R2  is  this  term, 
which  we  denote  as  the  scaling  factor.  When  F2(s)  is  expressed  in  the  form  used  in 
this  section,  namely  the  form  given  in  (8.11a),  C3,  instead  of  R2,  becomes  the 
scaling  factor. 

To  determine  this  C3  we  recognize  that  C3  in  the  F2  expression  is  the  same  as  the 
C3  in  the  G2  expression  as  given  in  (8.15).  To  determine  C3  in  the  G2  expression  we 
must  first  find  all  the  other  unknowns  in  G2.  It  turns  out  that  the  only  unknown  is  Kpd 
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and  hence  we  first  determine  Kpd.  Then  we  set  S=j(Qu  in  (8. 1 5).  From  the  definition,  I 
GiC/Om)!  =  1.  Hence  we  have  one  equation  and  one  unknown,  C3,  and  it  can  be 
determined.  This  is  what  we  do  next. 


8.4.4    Phase  Detector:  Determine  Kpd 

In  this  step  we  design  PD  and  use  condition  af(ii),  Table  8.1  to  determine  Kpd.  For 
PD,  we  decide  to  use  the  PFD/charge  pump  as  described  in  Figure  7.4  (the  C  in  the 
loop  filter  is  now  replaced  by  the  filter  F2(s),  shown  in  Figure  8.5).  How  do  we 
determine  its  required  Kpdl  There  are  a  few  alternatives.  We  choose  to  resort  to 
phase  noise  consideration  to  obtain  this  value  [12].  Remember  that  back  in  sub- 
section 8.3.1.2  we  ignored  phase  noise  from  PD.  Let  us  reintroduce  this  noise 
source  in  the  present  sub-section.  From  the  discussion  following  (7.20),  we  have 
concluded  that  Kpd  should  be  selected  large  enough  to  render  its  phase  noise 
contribution  from  charge  pump  at  one  channel  offset  insignificant  when  compared 
with  other  sources.  Hence  if  the  synthesizer  phase  noise  due  to  the  PD's  phase  noise 
alone  satisfies  condition  d(ii),  Table  8.1,  the  Kpd  selected  must  be  large  enough. 

From  (7.20),  to  obtain  this  phase  noise,  we  need  Hre{.  What  approximation 
should  we  use  for  this  //ref  at  co,XB„?  If  we  use  the  first-order  approximation  as 
given  in  (8.34),  then  since/channel  (=  1.7  MHz)  is  much  larger  than/„(=  51.6  KHz), 
Hlef  (/channel)  becomes  0.  This  will  make  Se0_Pd,  m  (7-20),  equal  to  0,  and  (7.20)  will 
be  useless  in  helping  us  to  design  Kpd.  Instead  we  should  use  the  second-order 
approximation  to  //ref  as  given  in  (8.51).  With  this  second  order  approximation,  we 
have  derived  the  resulting  //ref,  which  is  given  in  (8.58).  Substituting  this  //ref  from 
(8.58)  into  (7.20),  and  taking  the  logarithm  on  both  sides,  we  have 


Se0_Pd(<i>/)|dBc/Hz=  10  log 


«>uP3  ■  ■  ■  p„M \2 ind2 


("channel"    1KpdJ      A/ 


(8.127) 


From  Chapter  7,  ^f  is  the  PSD  of  the  noise  current  from  Mi  of  the  charge  pump 
in  Figure  7.4,  (reduced  by  a  factor  of  d,  the  duty  cycle,  since  the  charge  pump  is 
only  on  for  that  period  during  steady  state),  and  rachannei  is  the  offset  frequency  at 
which  we  calculate  the  5e0_Pd-  F°r  the  present  design  example,  F(s)  is  a  second- 
order  filter.  Hence  (8.127)  becomes 


Se0_Pd(co,-)|dBc/Hz=101og( 


(QuPiM    \    ind 


<°channel2^pd/       Af 


2  ^2 


(8.128) 


How  do  we  design  A"pd  to  satisfy  this?  We  assume  that  in  the  present  derivation, 
the  PD  is  the  only  noise  source  that  contributes  phase  noise  to  the  synthesizer  phase 
noise.  Therefore,  S'e_syn(rai)=>S'eo_pd(G)i)-  First,  from  condition  d(ii)  in  Table  8.1, 
Se_syn  must  be  less  than  —  102  dBc/Hz  at/j=/cnannel(=  1.7  MHz),  therefore,  Se0_pci 
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must  also  be  less  than  —  102  dBc/Hz  at/}=/channel(=1.7  MHz).  Second,  we  know 
that 


A/       \f-d  (8-129) 


Ind     'eq 


Here  «eq2/A/  is  the  PSD  of  the  noise  current  of  Mx  in  Figure  7.4.  Assuming 
thermal  noise  dominates,  ieq2/Af  is  given  by 


/eq2      4kT 

J  3  gml 


(8.130) 


By  definition, 


^charge_pump  27IAp(j 

^"Vb^vTV^Vi  (8-131) 

where  gml,  VCS\  are  the  gm,  VGS  of  Mj  in  Figure  7.4. 
Let  us  arbitrarily  set 

VGSi-Vt=lV  (8.132) 

Since  5eopd  nas  a  complicated  dependency  on  ^pd,  rather  than  finding  the 
explicit  formula  of  5e0_pd  m  terms  of  A"pd,  we  decide  to  find  the  Kpd  that  will  result 
in  a  satisfactory  Seopci  from  trial  and  error.  We  set  a  trial  Kpd  as: 

Kpd   =    lOuA/rad  (8.133) 

We  now  substitute  (8.132),  and  (8.133)  in  (8.131)  and  solve  forgml.  Substituting 
this  solved  g„a  in  (8.130)  we  solve  for  z'eq2/A/.  Again  substituting  this  solved 

i^f/Af  and  (8.125)  in  (8.129),  we  find  *■>£■. 


ind2      4x4.14x  10-21/ 


,  x  0.001  =  {0MpAy/rad/s 


3        2nxlO[lA/rad 

=  (0M5pA)2/Hz  (8.134) 

Substituting  (8.134),  (8.113),  (8.121),  (8.111),  (8.133),  and  fi)channel=27i  x  1.7 
Mrad/s  into  (8.128),  we  obtain 


482 


8     Frequency  Synthesizer:  Loop  Filter  and  System  Design 


S9o_Pd(1.7MHz)  =10  log 


51.6A:   51.6AT  x  3        1099    V    (0.015/?A) 
1.7M  '      1.7M      '  lOtiA/radJ  Hz 

101og{[2  x  10"17]rad2/Hz}  =  -166dBc/Hz 


Therefore,  5e_syn(1.7  MHz)  is  also  -166  dBc/Hz.  This  is  smaller  than  -102  dBc/Hz 
and  hence  condition  d(ii)  in  Table  8.1  is  satisfied.  Accordingly,  a  A"pc]  of  10  uA/rad 
gives  us  an  acceptable  design. 


8.4.5    Loop  Filter:  Part  2 

In  this  sub-section 

1.  We  substitute  Kpd  in  the  expression  for  G(/co).  Then  we  use  the  criterion  IG(/co„)l 
=  1  to  determine  the  scaling  factor  C3  After  that  we  use  poles  and  zeroes  to 
determine  values  for  C  and  R2- 

2.  Finally  we  can  check  the  values  of  R2,  C,  C3  and  see  if  they  are  realizable. 


8.4.5.1     Determine  the  Scaling  Factor 

We  turn  back  to  the  expression  of  F2(s),  rewritten  from  (8.11a)  as 

*2(*)=- 


We  can  now  find  C3  by  using  (8.15).  First,  from  the  definition; 

|G2(M,)I   =    1 
Then,  substituting  (8.15)  in  (8.136),  we  have 

ApdA^coCM/  +Zl) 


\G2(j<D«)\  = 

Rearranging  (8.137),  we  have 


M(j(ouyC3(j(Du+p3) 


Mffi2[/CO„+/?3| 


1 


(8.135) 


(8.136) 


(8.137) 


(8.138) 


Substituting  (8.133),  (8.95),  (8.113),  (8.120),  (8.111),  and  (8.121)  into  (8.138) 
and  solving,  we  have 
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C3=2.7pF  (8.139) 

Substituting  (8.123),  and  (8.139)  into  (8.12b),  we  have 

12=1+-|-  (8.140) 

2.7pF 

Solving,  we  have 

C   =   29.7  pF  (8.141) 

Finally,  substituting  (8.120)  and  (8.141)  in  (8.11b),  we  have 
81  Krad/s  =  ' 


tf2(29.7PF) 
Solving, 

fl2=410KQ  (8.142) 

We  have  finished  designing  the  components  value. 

8.4.5.2     Readability 

R2,  C,  C3  are  bounded  by  realizability  in  a  CMOS  process.  Remember  that  in 
integrated  circuits,  we  can  only  realize  small  resistors  and  capacitors,  because  they 
take  up  chip  area.  Certainly,  capacitors  on  the  order  of  microfarads  and  resistors  on 
the  order  of  megaohms  are  not  acceptable. 

To  check  if  our  present  design  meets  these  considerations,  first  we  assume  that 
for  our  0.6  urn  CMOS  technology,  which  has  a  double  polysilicon  capacitor 
available, 

cap/area  =   2  fF//mi2  (8.143) 

Hence  from  (8.141),  (8.143) 

29  7dF 

area  of  C  = '-^r  =  1.48  x  I04um2  =  121  urn  x  121  urn  (8.144) 

2fF//im2 

Even  though  this  is  large,  it  is  still  within  a  reasonable  fabrication  limit.  Cj,  is 
much  smaller  than  C  and  hence  will  be  within  the  fabrication  limit  as  well.  For  the 
resistor  we  assume  that  the  resistivity  of  this  process  is  given  by 

resistance/area  =    10  Q/um2  (8.145) 

From  (8.139),  and  (8.145),  we  have 
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410kQ  ,        , 

area  of  R2  =  — —. r  =  4.1  x  104  um2  =  202  urn  x  202  urn  (8.146) 

1012/  urn-2 

Again,  even  though  this  is  large,  it  is  also  within  a  reasonable  fabrication  limit. 

Hence  the  components  meet  the  realization  check. 

In  summary,  in  Section  8.4  we  have  designed  a  loop  filter  component  with  C= 
29.7  pF,  C3  =  2.7  pF,  R2  =  410  kQ.  We  have  also  determined  that  the  PD  should 
have  a  Kpd  =  10  uA/rad.  Together  with  the  VCO  we  have  designed,  which  has  a 
^vco  =  — 15  MHz/V,  we  have  completed  the  synthesizer  design. 


8.5     Implementation  of  a  Frequency  Synthesizer 
with  a  Fractional  Divider 

As  a  practical  example  of  a  complete  synthesizer,  a  2.4-GHz  frequency  synthesizer 
was  designed  that  uses  a  fractional  divider  to  drive  a  dual-phase-locked-loop  (PLL) 
structure,  with  both  PLLs  using  only  on-chip  ring  oscillators  [14].  The  first-stage 
narrow-band  PLL  acts  as  a  spur  filter  while  the  second  stage  wide-band  PLL 
suppresses  VCO  phase  noise  so  that  simultaneous  suppression  of  phase  noise  and 
spur  is  achieved.  A  new  low-power,  low-noise,  low-frequency  ring  oscillator  is 
designed  for  this  narrow-band  PLL.  The  chip  was  designed  in  0.35  urn  CMOS 
technology  and  achieves  a  phase  noise  of  97  dBc/Hz  at  1-MHz  offset  and  spurs  of 
55  dBc.  The  chip's  output  frequency  varies  from  2.4  to  2.5  GHz;  the  chip  consumes 
15  mA  from  a  3.3-V  supply  and  occupies  3.7  sq  mm. 


8.5.1    Architecture 

8.5.1.1     Functional  Description 

The  proposed  architecture  is  shown  in  Figure  8. 19.  It  starts  with  a  fractional  divider 
(FD)  which  is  used  to  generate  the  reference  frequencies  of  the  channels.  By  the 
nature  of  the  FD,  these  reference  frequencies  contain  spurs.  To  remove/suppress 
these  spurious  tones,  a  variable  center  frequency  filter  implemented  using  a  PLL  is 
adopted.  This  PLL  uses  a  narrow-band  loop  filter  so  that  it  has  a  sharp  frequency 
response  and  thus  can  remove  spurs  from  these  reference  frequencies.  This  PLL  is 
denoted  as  the  narrow-band  PLL  (NBPLL)  and  its  VCO  is  denoted  as  the  narrow- 
band VCO  (NBVCO).  Its  output  is  then  used  to  drive  a  high-frequency  PLL.  Since 
the  required  resolution  of  the  overall  synthesizer  can  be  achieved  via  the  FD,  this 
PLL  uses  only  an  integer-divider. 

Because  the  reference  frequencies  of  this  PLL  are  at  much  higher  frequency  than 
the  channel  bandwidth  (BW),  spurious  tones  from  its  phase  detector  are  of  no 
concern.  Thus,  the  loop  BW  can  be  made  very  wide  to  suppress  its  VCO  phase 
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Fig.  8.19    Block  diagram  of  2.4GHz  frequency  synthesizer  showing  frequency  planning 

noise.  This  PLL  is  denoted  as  the  wide-band  PLL  (WBPLL)  and  its  VCO  the  wide- 
band VCO  (WBVCO). 

The  fractional  divider,  FD,  is  implemented  using  just  an  accumulator  to  calcu- 
late phase.  The  FD  output  is: 


f         f  K 

JFD  —  Jin  T, 


(8.147) 


The  spurs  from  the  FD  is  at 


j,  _  ,    GCF(K,  N)  worstcase  -      1 

J  spur  —  Jin  T^  —      Jin  Tj 


(8.148) 


K  is  the  control  word  of  the  accumulator,  fin  is  the  input  frequency  and  N  is  the 
accumulator  size.  GCF  is  the  greatest  common  factor  and  for  worst  case  spur  it  is  1. 
The  FD  output  frequency  is  multiplied  by  the  divider  ratio  M  in  the  wideband  PLL 
to  get  the  overall  output  frequency. 


fwBPLL=fin^M  =  2.4GHz 


(8.149) 


A  small  M  is  desirable  to  keep  the  referred  noise  at  the  input  of  the  wideband 
PLL  from  being  greatly  multiplied.  Its  lower  limit  is  set  by  /FD  and  hence  the 
maximum  speed  at  which  fractional  divider  can  operate.  In  this  design  it  is  set  to  16. 
From  (8.148)  when  K  is  changed  by  1,  fwBPLL  should  change  by  the  minimum 
channel  resolution  (e.g  1  MHz  for  Bluetooth).  This  results  in 


f. 
\MHz  =  — 16 

N 


(8.150) 


To  reduce  spur  amplitude,  fin  should  be  increased  and  is  ultimately  limited  by  the 
speed  of  the  digital  hardware.  It  is  arbitrarily  set  to  be  256MHz.  Applying  this  to 
(8.150),  (8.149),  (8.148),  gives  N=4096,  K=2400-2483,  fFD=150MHz, 
fspur=62.5kHz.  This  FD  output  is  fed  into  narrowband  PLL  and  produces  a  filtered 
output  at  approximately  150MHz  (fNBPLL)-  The  narrowband  PLL  has  a  loop 
bandwidth  of  2kHz  and  cleans  up  the  spur  from  FD  at  multiples  of  62.5kHz.  The 
narrowband  PLL  output  of  150MHz  is  fed  into  the  wideband  PLL  with  a  loop 
bandwidth  of  10MHz  to  suppress  the  phase  noise  from  the  high  frequency  ring 
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oscillator.  The  spur  frequency  from  the  phase  detector  and  charge  pump  is  at  a 
relatively  high  frequency  of  150MHz  and  is  not  of  major  concern.  Representative 
spectra,  which  highlight  the  spurs  and  phase  noise  at  various  nodes  of  the  architec- 
ture, are  shown  in  Figure  8.20. 


8.5.1.2     Architectural  Advantage 

The  major  advantage  of  this  architecture  is  that  the  phase  noise  of  the  low  frequency 
ring  oscillator  in  the  narrowband  PLL  dominates  the  phase  noise  of  the  reference 
signal  to  the  wideband  PLL,  and  hence  the  overall  frequency  synthesizer  phase 
noise.  Since  the  low  frequency  ring  oscillator  has  lower  phase  noise  than  the  high 
frequency  ring  oscillator,  the  overall  synthesizer  phase  noise  is  reduced.  Specifi- 
cally, lower  frequency  oscillator  has  lower  phase  noise  because  of  the  following 
features: 
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Saturation  of  delay  cell  is  always  achieved  and  no  phase  noise  due  to  cycle  to  cycle 
correlation 

The  output  of  the  individual  delay  cell  in  this  low  frequency  oscillator  always 
reaches  the  power  supply  before  the  end  of  a  cycle  and  therefore  is  called  a 
saturated  ring  oscillator.  In  comparison  the  output  of  the  individual  delay  cell  in 
the  high  frequency  oscillator  is  never  saturated  and  all  the  transistors  are  on  all  the 
time.  Because  the  low  frequency  VCO  delay  cell  is  saturated,  some  of  the 
transistors  are  turned  off  for  some  time  during  each  period.  Thus  there  is  no  cycle 
to  cycle  correlation.  As  discussed  in  Chapter  7,  this  means  that  phase  noise  of  a 
saturated  ring  oscillator  is  lower  than  that  of  the  unsaturated  ring  oscillator,  even 
when  all  the  design  efforts  to  reduce  cycle  to  cycle  correlation  in  the  unsaturated 
ring  oscillator,  as  discussed  in  Chapter  7,  have  been  applied.  Therefore  the  low 
frequency  VCO  has  lower  absolute  phase  jitter,  leading  to  a  lower  phase  noise  of 
the  entire  frequency  synthesizer.  Note  that  since  absolute  jitter,  rather  than  relative 
jitter,  is  the  proper  performance  metric,  this  advantage  of  the  low  frequency  VCO 
over  the  high  frequency  VCO  remains  even  after  we  multiply  the  phase  noise  of  the 
low  frequency  VCO  in  the  narrowband  PLL  by  the  divider  ratio  M  (=16). 


Higher  Slew  Rate  Because  of  Negligible  Displacement  Current 

At  high  frequency,  parasitic  capacitance  becomes  a  significant  factor  in  impacting 
the  performance  of  the  circuit.  For  the  present  design  where  the  delay  cell  is 
implemented  using  a  source  coupled  pair  (SCP),  the  impact  of  parasitic  capacitance 
at  the  common  source  node,  Cs  (Cst,  of  the  diff  pair  and  C^  and  C^b  of  the  tail 
current  mirror),  of  a  conventional  differential  delay  cell  based  ring  oscillator  is  of 
particular  importance.  During  every  cycle  of  oscillation,  the  common  source  node 
between  the  differential  pair  would  undergo  significant  perturbations.  Since  the 
common  source  voltage  has  to  shift  up  every  half  cycle  to  completely  turn  off  one 
side  of  the  differential  pair,  one  can  see  the  source  node  is  fluctuating  at  two  times 
the  frequency  of  the  input  to  the  differential  pair.  At  the  point  of  transition  (i.e. 
differential  input  voltage  is  zero),  the  sensitivity  of  the  delay  cell  to  noise  is  the 
greatest  since  this  is  the  onset  of  the  rising/falling  transition.  At  this  time,  note  that 
the  source  node  is  at  approximately  its  lowest  potential  (the  parasitic  capacitance  Cs 
is  discharging),  this  implies  that  the  current  going  through  the  differential  branches 
are  less  which  in  turn  means  less  slew  current  is  available  to  charge  up  the  load 
capacitance.  Since  phase  noise  is  inversely  proportional  to  the  amount  of  current 
available  to  charge  up  the  load  capacitance,  one  can  see  that  the  phase  noise  is 
impacted  negatively  by  the  above  phenomenon.  Thus  in  general,  we  want  to 
minimize  the  displacement  current  that  is  spent  on  charging  and  discharging  the 
Cs  every  half  cycle.  Note  that  as  frequency  decreases,  this  displacement  current  is 
less.  Therefore  the  phase  noise  of  the  low  frequency  oscillator  is  less. 
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Fig.  8.21    New  delay  cell  schematic 


8.5.1.3     Circuit  Implementation 


The  phase  frequency  detector  (PFD),  charge  pump,  dividers  and  high  frequency  VCO 
with  replica  bias  are  implemented  in  a  differential  fashion  using  traditional  designs, 
like  that  covered  in  Chapter  7.  The  high  frequency  VCO,  being  unsaturated,  should 
be  designed  according  to  Chapter  7  to  reduce  cycle  to  cycle  correlation.  On  the  other 
hand,  the  150  MHz  low  frequency  VCO  needs  to  remain  saturated.  Adding  more 
capacitance  or  using  more  stages  to  make  it  run  at  lower  frequency  while  the  VCO 
remains  saturated  results  in  poorer  phase  noise  or  more  power  consumption.  A  new 
delay  cell  is  proposed  to  solve  this  problem.  Figure  8.21  shows  such  a  delay  cell  for 
the  3-stage  low  frequency  ring  oscillator.  In  this  new  delay  cell,  a  cross  coupled 
circuit  (M3,  M4)  is  added  to  the  traditional  differential  delay  cell.  It  should  be  noted 
that  this  cross  coupled  circuit  is  not  operated  in  a  positive  feedback  mode,  as  in 
traditional  cross  coupled  circuit.  Rather,  this  new  addition  allows  the  delay  cell  to 
operate  at  lower  frequency  while  maintaining  a  high  slew  rate.  It  does  so  by  stealing 
the  current  available  to  charge/discharge  the  capacitors. 

We  will  explain  how  the  circuit  works  on  a  low  to  high  transition  at  Vout~.  At  the 
beginning  of  this  transition,  M4  is  in  saturation  region  and  provides  constant 
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Fig.  8.22    New  VCO  output  waveform 


current,  because  Vas4  has  not  changed.  When  Vgsl  starts  to  decrease,  Idl  starts  to 
decrease  as  well  leading  to  an  increase  in  Ic.  As  shown  in  Figure  8.22,  this  causes 
the  low  to  high  transition.  However,  the  rate  of  transition  is  slow  because  only  Idl  is 
reduced  while  Id4  and  Id5  remain  relatively  constant.  Hence  we  introduce  a  slow 
phase  in  the  oscillation  period  which  starts  at  t0  and  ends  at  tslow.  To  be  precise  tslow 
occurs  when  Vout+  starts  to  decrease  as  shown  in  Figure  8.22.  The  slow  phase 
persists  until  Id4  starts  to  decrease  when  VgS4  decreases.  When  Id4  starts  to  decrease, 
the  slew  rate  returns  to  normal  and  we  enter  the  slew  phase.  Because  the  oscillation 
period  is  now  a  sum  of  the  slow  phase  and  slew  phase,  the  oscillation  slows  down. 
Thus,  the  circuit  achieves  the  desired  effect  in  that  it  slows  down  the  oscillation 
without  adding  extra  loading  elements  and  without  slowing  down  the  slew  rate  at  the 
actual  crossing.  Effectively  the  added  cross  coupled  circuit  senses  the  voltage  at 
the  output,  then  steals  or  returns  the  current  at  different  times  during  transition, 
therefore  causing  the  slow  phase  and  slew  phase.  The  two  different  phases  can  be 
seen  in  Figure  8.22.  Since  this  is  a  saturated  ring  oscillator,  all  the  phase  noise  design 
considerations  covered  in  Chapter  7  can  be  applied  to  the  present  case. 
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8.6     Problems 

8.1  Explain  qualitatively  that  the  PLL  behaves  like  a  bandpass  filter  even  though  it 
only  has  a  lowpass  filter  inside. 

8.2  Explain  qualitatively  that  in  a  frequency  synthesizer,  when  we  want  to  select  a 
new  channel  via  changing  the  divider  ratio,  initially  the  loop  will  not  track,  but 
eventually  it  will  track. 

8.3  We  stated  in  sub-section  8.3.1.2  that  the  transfer  function  //vco(<»,)  is  a  high- 
pass  function.  When  expressed  in  co,  //vco(co)  is  actually  a  band  reject  function 
centered  in  co0.  Explain  how  this  transformation  from  high  pass  to  band  reject 
function  occurs.  Also  explain  why  we  can  still  use  the  concept  of  transfer 
function  (which  is  valid  only  for  a  linear  time  invariant  (LTI)  system)  when  the 
PLL  is  actually  a  linear  periodic  time  varying  (LPTV)  system. 

8.4  Suppose  that  in  the  design  example  in  Section  8.4  we  tighten  our  power  specs 
and  we  try  to  meet  these  new  power  specs  by  going  through  sub-section  8.4.2.1 
again  with  a  reduced  /bias. 

(a)  What  happens  to  the  phase  noise  of  the  ring  oscillator  when  we  reduce  /bias? 

(b)  If  originally  both  the  phase  noise  and  spur  specs  are  met,  can  we  be  sure 
now? 

(c)  In  Section  8.4  we  designed  the  loop  filter  based  on  phase  noise  (that  is, 
following  the  approach  in  Sub-section  8.3.1).  Let  us  now  go  to  Sub-section 
8.4  and  change  the  design  approach:  we  choose  to  design  the  loop  filter 
based  on  spur  (that  is,  following  the  approach  in  sub-section  8.3.2).  Using 


Vn=(4KTBR) 
Fig.  P8.1 

this  new  approach,  answer  (b)  again. 
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8.5  In  Section  8.4,  what  potential  problem  will  we  encounter  if  we  swap  sub- 
section 8.4.2  and  sub-section  8.4.3  (i.e.,  we  design  the  loop  filter  before  we 
design  the  VCO)? 

8.6  In  sub-section  8.4.4,  we  derived  the  formula  for  Se0_Pd>  for  phase  noise  contri- 
bution from  the  phase  detector  [see  (8.127)].  Derive  a  similar  formula  for 
Sq0_r  >  f°r  phase  noise  contribution  from  resistor  R2  in  the  loop  filter  used  in 
section  8.4  (assume  a  large  b). 

8.7  Figure  P8. 1  shows  a  loop  filter  whose  output  is  taken  between  R2  and  C3.  This  is 
to  replace  the  loop  filter  used  in  Section  8.4. 
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We  are  interested  in  the  thermal  noise  contribution  from  resistor  R2  to  the  phase 
noise  at  the  output  of  the  synthesizer.  Let  us  assume  that  the  noise  from  R2  is 
modeled  as  shown  in  Figure  P8.1  where  it  is  represented  by  a  voltage  source 
whose  Laplace  transfrom  is  Vn(s)  and  has  the  same  amplitude  {—^/AkTBR2)  as 
the  equivalent  input-referred  voltage  noise  source.  Here  B  is  the  bandwidth  of 
interest. 

(a)  Derive  HR  (=Q0(s)/Vn(s)),  the  transfer  function  from  the  resistor  R2  to  the 
output  of  synthesizer.  The  transfer  function  should  be  expressed  as  a 
function  of  Kvco,  R2,  C3,  B  and  other  relevant  parameters.  Then  use  HR^ 
to  derive  Qa(s)  as  a  function  of  y/4kTBR2. 

(b)  The  resulting  phase  noise  is  of  greatest  interest  in  the  adjacent  channel 
region.  Using  the  appropriate  assumption,  simplify  the  Q0(s)  expression 
derived  in  (a). 

(c)  Calculate  the  Sq0_Ri  from  expression  in  (b)  in  dBc/Hz  at  an  offset  frequency 
CD;  of  25  kHz,  with  R2  =  100  kQ,  C3  =  0.0047  uF,  ATvco  =  5  MHz/V. 

Suppose  we  have  a  new  standard  that  is  identical  to  DECT  except  that/0  is 
reduced  by  half.  Hence  the  specs  given  in  Table  8.1  have  their  center  frequency 
reduced  by  a  half  to  950  MHz,  while  everything  else  remains  the  same.  To 
design  the  frequency  synthesizer  for  this  new  standard,  we  assume  that  sub- 


Fig.  P8.2 


section  8.4.2  is  already  carried  out  and  a  new  VCO  with  this  new /„  is  designed. 
We  further  assume  that  this  new  VCO  has  been  designed  such  that  N,  /bias  and 
{WIL)i,  and  Vqs\-V,  remain  the  same  as  the  case  in  sub-section  8.4.2.  Our  task 
is  to  design  a  new  loop  filter.  To  design  the  new  loop  filter,  let  us  assume  that  it 
has  the  same  filter  configuration  as  used  in  section  8.4.3,  but  with  possibly 
different  poles/zeroes  locations  and  hence  a  different/,,.  Find  the/„  for  this  new 
loop  filter  and  hence  the  new  poles/zeroes  locations. 
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8.9  We  would  like  to  use  the  Colpitts  oscillator  described  in  Problem  7.5  of 
Chapter  7  to  replace  the  ring  oscillator  used  in  the  design  example  in  Section 
8.4  and  redesign  the  frequency  synthesizer.  This  oscillator  is  redrawn  in 
Figure  P8.2.  Use  a  reasonable  assumption  for  the  bipolar  technology  that  is 
used  to  implement  Qx. 

Since  this  oscillator  has  different  phase  noise  behaviour,  the  loop  filter  has  to 
be  redesigned.  We  assume  that  everything  else  in  the  synthesizer  remains  the 
same. 

Start  with  the  Se_VCo  (/,)  derived  for  this  oscillator  in  Problem  7.7,  Chapter  7. 
Follow  the  design  example  (sub-section  8.4.3),  and  recalculate  fc,  fu 

.10  Design  the  loop  filter  for  a  frequency  synthesizer.  The  synthesizer  should  be 
able  to  produce  a  set  of  frequencies  in  the  range  from  1  to  2  MHz  with  a 
channel  spacing  of  10  kHz  (i.e.,  frequencies  of  1000,  1010,  1020,  . . .,  2000 
kHz  will  be  generated).  We  will  use  a  PFD  as  phase  detector,  whose  Kpd  is 
given  as  1  mA/rad.  Because  this  detector  offers  infinite  lock  range  for  any  type 
of  loop  filter,  we  use  the  simplest  of  these,  the  passive  lag  filter,  as  shown 
Figure  P8.3.  We  assume  that  a  programmable  divider  is  used  whose  ratio 
varies  from  M  =  100  to  200.  For  subsequent  calculation  assume  that  the 
synthesizer  is  optimized  for  the  divider  ratio  Mmean  =  \/MmmMmix  =  141 .  The 
VCO  was  given  a  gain  Kvco  =  2.24  x  106  rad/s/V. 

(a)  State  F(s),  the  loop  filter  transfer  function,  in  terms  of  R1?  R2,  C.  Plot  the 
Bode  plot  of  F(s).  Specify  the  low-frequency  gain  and  high-frequency 
gain  in  terms  of  Rlt  R2. 


*pd 


/h 


T—r 


>R' 


O 


f*2 


c 


If 


Fig.  P8.3 


References  493 

(b)  Set/Z,  the  zero  frequency  of  the  loop  filter  F(s),  to  be  3  times  the  unity  gain 
frequency/,,  of  the  loop  transfer  function  G(s).  Assume  that  R2  =  240  Q, 
C  =  0.33  uF.  Calculate  Rt. 

(c)  Calculate  the  phase  margin  of  the  synthesizer.  Is  the  design  stable? 

(d)  Assume  that  d  (the  duty  cycle)  of  ;'spur  coming  from  the  PFD  is  0.001. 
Calculate  the  spur  in  dBc  at  the  output  of  the  synthesizer. 

(e)  Upon  changing  the  divider  ratio,  this  synthesizer  has  to  lock  (settle)  to  a 
final  /error  within  a  specific  time,  called  the  lock  time  TL.  Assume  an 
/step=/ref=  10  kHz  and  an  allowable  fmor~  1  Hz.  Calculate  TL. 


References 

1.  T.  C.  Weigandt,  B.  Kim,  and  P.  R.  Gray,  "Analysis  of  Timing  Jitter  in  CMOS  Ring  Oscillators,  " 
in  Proc.  ISCAS,  June  1994,  pp.  4.31^1.35. 

2.  T.  C.  Weigandt,  "Low  Phase  Noise,  Low  Timing  Jitter  Design  Techniques  for  Delay  Cell 
Based  VCOs  and  Frequency  Synthesizers  "  thesis,  University  of  California,  Berkeley,  1998. 

3.  B.Razavi,  "A  Study  of  Phase  Noise  in  CMOS  Oscillators,"  IEEEJSSC,  Vol.  31, p.  334,March 
1996. 

4.  F.  Gardner,  "Charge  Pump  Phase  Lock  Loops,"  IEEE  Transactions  on  Communications,  Vol. 
28,  No.  11,  pp.  1849-1858,  November  1980. 

5.  F.  Gardner,  Phaselock  Techniques,  2nd  ed.,  Wiley,  1979. 

6.  P.  Gray  and  R.  Meyer,  Analysis  and  Design  of  Analog  Integrated  Circuits,  3rd  ed.,  Wiley. 
1993. 

7.  R.  Best,  Phase  Locked  Loops,  Theory,  Design  and  Applications,  2nd  ed.,  McGraw-Hill,  1984 

8.  W.  Egan,  Frequency  Synthesis  by  Phase  Lock,  Wiley,  1981,  p.  40. 

9.  W.  Egan,  Frequency  Synthesis  by  Phase  Lock,  Wiley,  1981,  p.  95. 

10.  James  A.  Crawford,  Frequency  Synthesizer  Design  Handbook,  Artech  House,  1994,  p.  122. 

11.  P.  Gray,  "An  Integrated  GSM/DECT  Receiver,  Design  Specification,"  Design  Notes,  University 
of  California,  Berkeley,  1998. 

12.  F.  Martin,  "Frequency  Synthesizers  in  RF  Wireless  Communications,"  Course  Notes,  1994, 
Motorola,  Plantation,  FL. 

13.  T.  Lee,  Design  of  CMOS  RF  Circuits,  p.  497,  Cambridge  University  Press,  1998. 

14.  Z.  Shu,  K.  Lee,  B.  Leung,  "A  2.4-GHz  Ring  Oscillator  Based  CMOS  Frequency  Synthesizer 
with  a  Fractional  Divider  Dual  PLL  Architecture",  IEEE  JSSC,  vol.  39,  pp.  452^162, 
March  2004. 


Chapter  9 

Transmitter  Architectures 

and  Power  Amplifier 


9.1     Introduction 

In  Chapter  1  we  focused  on  the  blocks  enclosed  by  dotted  lines  in  Figure  1.1.  They 
are  the  modulator,  demodulator,  and  channel  part  of  the  communication  system. 
We  now  redraw  Figure  1.1  as  Figure  9.1,  with  the  transmitter  back  end  enclosed  by 
dotted  lines. 

We  first  review  some  general  philosophy  on  deciding  the  back  end  architecture. 
Since  the  transmit  architecture  is  important  in  GSM  while  not  so  important  in 
DECT,  we  use  it  as  an  example  in  this  chapter.  We  will  refer  to  both  GSM  and 
DCS  1800,  which  is  a  more  modern  and  up-banded  version  of  GSM. 

We  then  adopt  specific  transmit  architecture.  Next  the  back  end  sub  blocks  are 
elaborated.  Examples  of  these  sub  blocks  include  mixers,  frequency  synthesizers, 
digital  to  analog  converters  (DAC),  I/Q  (or  RF  quadrature)  generator  and  power 
amplifiers  (PA). 


9.2     Transmitter  Back  End:  General  Discussion 
9.2.1     Motivations  and  General  Design  Philosophy 

As  in  the  chapter  on  receiver,  where  the  front  end  allows  us  to  satisfy  the  boundary 
condition  imposed  by  the  received  signal  and  the  required  SNR  for  the  demodula- 
tor, here  in  the  transmitter,  the  back  end  allows  us  to  take  the  baseband  signal  and 
conditions  it  to  the  required  power  level  and  carrier  frequency  for  a  given  standard, 
in  this  case  GSM/DCS  1800.  Conditioning  also  includes  proper  filtering,  so  that  the 
transmitted  signal  does  not  spill  over  and  causes  interference,  such  as  adjacent 
channel  interference. 

There  are  a  large  variety  of  transmitter  architectures,  each  tailor  towards  a 
particular  standard.  One  important  feature  of  the  standard  is  the  cell  size.  Since 
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Fig.  9.1    Back  end  of  the  transmitter  enclosed  in  dotted  line 

cellular  standards  (e.g.  GSM)  involve  larger  cell  size,  the  required  performance 
requirement  is  higher  than  wireless  LAN  standards  (e.g.  802. 11).  On  the  other  hand, 
because  our  underlying  technology  is  CMOS,  with  low  Q  and  low  breakdown 
voltage,  there  are  limitations  on  the  type  of  circuits  that  can  be  employed.  For 
example,  some  power  amplifier  (PA)  design,  well  suited  for  technology  with  higher 
breakdown  voltage  such  as  GaN,  is  not  suitable  here.  This  limitation  also  impacts 
the  resulting  transmitter  architecture. 

Transmitters  broadly  fall  into  two  classes,  depending  on  how  mathematically  the 
complex  signal,  with  I,  Q  components,  is  represented  and  transmitted.  In  I/Q 
modulator-based  transmitters  the  I,  Q  signals  are  represented  in  Cartesian  form 
i.e.  as  rectangular  components  of  the  complex  signal.  Each  of  the  components  is 
then  converted  via  DAC,  and  the  resulting  analog  signals  are  upconverted,  com- 
bined, amplified  by  the  PA  and  transmitted.  In  polar-based  transmitters,  the  I  and  Q 
signals  are  represented  in  polar  form  i.e.  as  amplitude  and  phase  of  the  complex 
signal.  The  phase  can  be  encoded,  for  example,  in  a  PLL  and  the  amplitude  encoded 
by  varying  RF  signal  gain  of  the  PA,  such  as  through  the  use  of  varying  power 
supply  as  well  as  biasing.  Although  polar-based  transmitters  have  shown  promise 
with  respect  to  the  elimination  of  discrete  components,  such  as  RF  filter  however, 
there  is  the  question  of  limited  data  rate  imposed  by  PLL  loop  bandwidth  and 
matching  between  the  phase  and  amplitude  path  [1],  as  well  as  efficiency  and  output 
power  control  when  applying  the  full  modulation  before  the  signal  reaches  the  PA 
[4],  In  particular  it  may  not  be  straightforward  to  implement  the  required  power 
control  by  varying  PA  gain.  For  the  rest  of  the  chapter  we  will  concentrate  on  the  1/ 
Q  based  architecture. 


9.2.2    Direct  Conversion  and  Other  Architectures 

We  start  off  our  discussion  with  the  direct  conversion  [2]  (sometimes  called 
homodyne)  architecture  as  shown  in  Figure  9.2,  where  we  denote  all  the  circuitry 
enclosed  by  the  dotted  line  as  the  back  end.  We  can  observe  that  the  back  end  is 
attractive  because  of  the  simplicity  of  signal  path.  Note  this  architecture  is  like  the 
reverse  of  that  shown  in  problem  2.9,  and  with  both  the  I,  Q  path  explicitly  shown. 
The  modulator  takes  the  source  data  and  generates  the  I,  Q  signals,  at  baseband.  They 
typically  pass  through  a  D/A  converter,  an  AGC  and  a  baseband  filter.  They  then  get 
upconverted  to  RF  in  a  single  step  via  two  separate  mixers,  which  has  local  oscillator 
signals  LOi  and  LOq.  The  I  and  Q  signals  are  combined  to  a  single  sideband. 
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The  output  is  then  passed  through  a  pre-power  amplifier  (PA)  RF  filter.  The  filter 
output  is  amplified  by  the  PA,  and  then  filtered  by  another  RF  filter,  before  sent  off  to 
the  antenna. 

Some  comments  on  the  various  filters  are  as  follows:  the  baseband  filter  (e.g. 
needed  for  reconstruction)  typically  is  rather  relaxed,  with  a  single-stage  filtering 
suffixes.  The  pre-PA  RF  filter  is  needed  to  reduce  third-order  intermodulation  in  the 
PA  as  well  as  transmitted  noise.  The  RF  filters  after  the  PA  further  reduces  transmit 
noise. 

Not  shown  in  the  diagram  is  the  frequency  synthesizer,  which  generates  the  LO 
signal.  The  LO  is  variable  to  perform  frequency  tuning.  Also  not  shown  is  the 
quadrature  LO  generator,  which  takes  this  LO  signal  and  generates  LOi  and  LOq 
signals.  LOt  and  LOq  are  90°  out  of  phase. 

In  addition  to  the  direct  conversion  architecture,  there  are  many  other  types  of 
possible  I/Q  based  transmitter  architectures.  For  example,  like  the  receiver,  the 
transmitter  can  also  use  traditional  heterodyne  or  indirect  upconversion  architec- 
ture, shown  in  Figure  9.3.  Contrary  to  the  direct  conversion  architecture,  the 
frequency  conversion  is  done  in  two  steps.  First  the  baseband  signal  is  frequency 
translated  to  a  fixed  IF  by  a  first  set  of  mixers,  with  a  fixed  LO,  LO2.  This  is  then 
passed  through  an  IF  filter  to  remove  spurious  IF  harmonics  created  by  this  mixing. 
The  output  is  then  mixed  by  a  second  set  of  mixers  with  another  LO,  LO1.  LOi  is 
variable  and  performs  frequency  tuning.  The  rest  is  like  the  direct  conversion 
architecture.  LO2  is  typically  fixed  while  LOi  is  typically  varying,  thus  providing 
tuning.  The  architecture  is  more  complicated.  There  is  more  filtering.  There  is  also 
more  mixing,  which  means  more  mixers,  and  extra  frequency  synthesizers  as  well 
to  generate  the  additional  LO  signals.  However,  compared  to  the  direct  conversion 
architecture  the  I/Q  modulation  is  performed  at  lower  frequency,  which  means  LO 
feedthrough  is  reduced. 
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Fig.  9.3    Indirect  conversion  transmitter  architecture 
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Fig.  9.4    Harmonic  rejection  transmitter 


A  modified  heterodyne  transmitter  is  reported  in  [3],  This  dual  conversion 
architecture,  called  "sliding  IF"  superheterodyne  transmitter  [17],  has  LOi  harmon- 
ically related  to  LO2,  with  both  LOi,  LO2  high  in  frequency  (that  is  high  IF 
architecture).  Therefore  there  is  no  need  for  an  additional  PLL  to  generate  LO2. 
Instead  it  is  derived  from  LO2  by  a  divider  with  low  division  ratio. 

Another  modified  heterodyne  transmitter,  called  the  harmonic -rejection  trans- 
mitter (HRT),  is  also  reported  in  [5],  and  is  shown  in  Figure  9.4. 

It  uses  a  similar  principle  as  the  indirect  conversion  architecture,  but  it  replaces 
the  first  mixer  (with  local  oscillator  LO2)  by  a  harmonic  rejection  mixer  (HRM). 
HRM  has  been  used  in  both  transmitter  and  receiver  [1].  To  explain  its  operation,  let 
us  consider  the  I  path  only  (and  hence  102,) ■  From  the  square  wave  to-i,,  a  set  of 
signals  are  derived,  where  each  signal  has  the  same  frequency  as  the  W2n  but  are 
separated  uniformly  in  phase.  To  perform  mixing,  the  input  signal  (I-path)  is  split 
into  multiple  paths.  Through  each  path  the  input  signal  is  mixed  with  one  of  these 
LO2,  derived  signal  via  a  basic  mixer,  which  acts  simply  as  a  multiplier.  This  is  to  be 
repeated  by  the  input  signal  in  the  Q  path.  Note  the  HRM  in  Figure  9.4  splits  signals 
into  2  paths.  With  both  I,  Q  paths  that  means  there  is  a  total  of  4  such  basic  mixers. 
This  is  to  be  contrasted  to  the  indirect  conversion  transmitter,  shown  in  Figure  9.3, 
where  the  input  signal  in  each  of  I,  Q  path  is  mixed  via  only  one  such  basic  mixer, 
with  £02,  and  W2g,  respectively,  meaning  only  a  total  of  2  such  basic  mixers. 
Returning  to  HRM  in  Figure  9.4,  the  outputs  from  all  these  paths  are  then  combined 
to  form  the  output  of  the  HRM.  Mathematically  this  results  in  cancellation  of 
selected  harmonic  of  LO2,  and  W2Q  ■  For  example  (and  again  considering  only  the 
I  path  and  hence  W2,),  let  us  assume  this  time  3  derived  waveforms  are  generated. 
First  one  is  102,  itself.  The  second  one  is  like  the  first  one,  but  whose  phase  is 
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advanced  by  45°  and  amplitude  0.707  times  the  first  one,  denoted  as  U>2I+is  ■  The 
third  one  is  like  the  first  one,  but  whose  phase  is  delayed  by  45°  while  amplitude  is 
still  0.707  times  the  first  one,  denoted  as  L02I_i5.  Mixing  (multiplication)  in  the 
HRM  then  means  a  delay  of  135°  at  third  harmonic  of  input  and  a  delay  of  225°  at 
fifth  harmonic  of  input  are  generated.  The  output,  which  is  the  sum,  will  then  see 
cancellation  of  the  third  and  fifth  harmonic,  and  harmonic  rejection  is  achieved.  The 
same  situation  applies  to  the  Q  path.  When  it  is  used  in  the  transmitter,  this  means 
the  baseband  signal  will  not  be  upconverted  to  around  these  harmonics.  Rather,  it 
will  be  upconverted  to  just  around  the  fundamental.  Thus  the  PA,  generally 
nonlinear,  will  not  generate  intermodulation  products  between  the  upconverted 
baseband  signals  at  the  fundamental  and  harmonics.  Accordingly,  the  IF  filter  and 
pre-RF  filter,  shown  in  the  indirect  conversion  transmitter  in  Figure  9.3,  is  no  longer 
needed,  as  evident  by  their  absence  in  Figure  9.4.  This,  of  course,  is  because  the 
original  reason  for  these  filters  is  to  filter  out  baseband  signal  upconverted  to  around 
harmonics  of  LO.  A  more  detail  explanation  of  this  aspect  will  be  given  later  in 
9.4.1.3,  after  the  introduction  of  PA  nonlinearity,  and  also  in  9.4.5,  which  shows  the 
trade-off  between  the  PA  nonlinearity  and  these  filter  suppression,  thus  highlighting 
one  of  the  advantages  of  the  harmonic  rejection  transmitter. 

Another  difference  with  the  indirect  conversion  architecture  is  the  tuning  aspect. 
As  mentioned  above,  for  the  indirect  conversion  architecture,  LO2  is  typically  fixed 
while  LOi  is  typically  varying,  thus  providing  tuning.  On  the  other  hand  in  Figure 

9.4,  the  opposite  is  true,  with  LO2  varying  and  provides  tuning  while  LOi  is  fixed. 
In  general  the  phase  noise  of  a  frequency  synthesizers  is  usually  higher  when  the 
LO  it  generates  is  higher  in  frequency,  as  well  as  being  variable.  Thus  LOi  in  Figure 

9.3  has  a  high  phase  noise  requirement,  while  both  LOi  and  LO2  in  Figure  9.4  have 
moderate  phase  noise  requirement  and  on  chip  implementation  becomes  possible. 
Because  of  the  high  phase  noise  in  LOi  in  Figure  9.3,  the  transmitted  noise  also 
increases,  and  both  the  pre-PA  RF  filter  and  the  RF  filter  are  adopted  to  suppress  it. 
With  a  moderate  phase  noise  in  LOi  in  Figure  9.4  and  hence  lower  transmit  noise, 
only  the  RF  filter  is  used  to  suppress  the  transmit  noise. 

Finally  there  is  also  the  frequency  translation  architecture  [11],  shown  in  Figure 

9.5.  Like  the  indirect  architecture,  the  baseband  signal  is  frequency  translated  to  a 
fixed  IF,  at  f,—  f2  in  this  architecture.  This  is  then  filtered  by  a  BPF  to  remove 
unwanted  IF  harmonics.  The  IF  signal  is  then  upconverted  to  RF,  at  f2.  Unlike  the 
indirect  architecture,  where  the  IF  signal  is  upconverted  to  RF  via  a  mixer,  here  the 
upconversion  is  done  by  a  PLL,  which  internally  has  a  mixer  in  the  feedback  path. 
This  feedback  mixer  takes  an  input  from  a  fixed  oscillator,  at  frequency  f,.  It  then 
mixes  the  PLL  output  with  this  f:  signal,  feeds  the  resulting  mixed  output  back,  and 
ignoring  for  the  time  being  the  feedback  LPF,  compares  this  mixed  output  with  the 
fx — f2  input  signal  of  the  PLL,  via  the  phase  detector.  When  the  PLL  is  in  lock,  the 
mixed  output  is  forced  to  have  frequency  also  at  fj—  f2.  Working  backwards  along 
the  feedback  mixer,  since  its  output  has  frequency  fj— f2,  and  we  already  know  one 
of  its  inputs  has  frequency  f1;  its  "other"  input's  frequency  would  be  f2.  Therefore 
the  PLL  output,  or  the  feedback  mixer's  "other"  input,  has  frequency  f2.  Returning 
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now  to  the  function  of  the  feedback  LPF,  which  we  have  so  far  ignored,  we  will 
discuss  its  function  by  starting  with  the  output  of  PLL,  which  we  have  just  shown 
has  frequency  f2.  Now  mixing,  via  the  feedback  mixer,  with  fj,  its  output  has 
frequencies  fj— f2  and  fj  +  f2.  The  feedback  LPF  is  designed  to  filter  the  f;  +  f2 
frequency  component.  Thus  the  feedback  signal  to  the  PD  is  indeed  ft— f2,  consis- 
tent with  the  discussion  above.  In  essence  the  PLL  acts  as  a  "mixer",  and  mixes  the 
two  input  signals:  one  at  f;  -  f2  and  one  at  fu  and  output  the  desired  signal  at  f2.  The 
rest  follows  the  indirect  architecture. 

The  above  discussion  brought  us  to  the  end  of  surveying  transmitter 
architectures.  Next  we  will  move  on  and  discuss  the  implementation  of  the  trans- 
mitter. For  the  rest  of  the  chapter  we  will  concentrate  on  direct  conversion  archi- 
tecture to  help  illustrate  such  circuit  implementation. 

Referring  back  to  Figure  9.2,  in  direct  conversion  architecture  the  filter  design  is 
similar  to  that  in  the  receiver  chapter,  although  the  requirement  is  different.  This 
will  be  discussed  more  later,  for  example,  in  section  9.4.5.  One  way  to  design  the 
D/A  converter  is  to  generalize  the  design  of  1-bit  internal  D/A  converter  design 
(used  inside  a  sigma-delta  A/D  converter  in  chapter  6)  to  a  multi-bit  general 
purpose  D/A  converter.  Again  the  choice  of  D/A  converter  can  be  rather  general. 
Whereas  in  chapter  6,  the  D/A  converter  is  a  switch  capacitor  type  D/A  converter, 
DAC  used  in  transmitter  can  also  be  resistor  string  based.  As  for  the  mixer  design, 
the  techniques  presented  in  chapter  4,  5  can  be  used.  For  LO  generation,  the 
frequency  synthesizer  technique  presented  in  chapter  7,  8  can  be  used  to  generate 
the  primary  LO  signals.  Quadrature  LO,  LOi  and  LOq,  are  then  generated  from  it. 
The  new  blocks  in  Figure  9.2  are  then 

a)  Quadrature  LO  generator 

b)  Power  amplifier 


The  rest  of  this  chapter  concerns  their  design. 
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9.3     Quadrature  LO  Generator 

For  direct  conversion  architecture,  single  sideband  modulation  is  used  and  quadra- 
ture LO  is  needed.  Not  only  do  we  want  LOi  and  LOq  to  differ  by  90°,  it  is  also 
desirable  that  both  signals  (typically  square  waves)  have  50%  duty  cycle  Some 
design  issues  for  these  circuits  include  oscillation  frequency,  amplitude  imbalance 
and  phase  imbalance. 


9.3.1     Single  Ended  RC 

A  single-ended  RC  quadrature  generator  is  shown  in  Figure  9.6a.  It  consists  of 
feeding  the  LO  to  two  sections:  an  RC  section  and  a  CR  section. 
When  LO  is  applied  to  R;,  Q,  the  circuit's  response  is: 


Hf)=T 


j2nC\R\ 
-j2nfClRl 


(9.1) 


Ri,  Ci  were  chosen  to  give  a  45°  phase  shift  between  LO  and  LOi  at  the  frequency 
of  operation  (the  carrier  frequency),  as  shown  in  Figure  9.6b. 
LO  is  also  applied  to  C2,  R2-  The  circuit's  response  is: 


0(f) 


1 
l-j2TtfC2R2 


(9.2) 


(a)  (b) 

Fig.  9.6    Single  ended  RC  quadrature  LO  generator  (a)  circuit  (b)  relative  phase  of  90° 


502 


9     Transmitter  Architectures  and  Power  Amplifier 


R2,  C2,  were  chosen  so  that  there  is  a  —45°  phase  shift  between  LO  and  LOq  at  the 
frequency  of  operation,  as  shown  in  Figure  9.6b  (if  R!=R2,  Ci=C2,  the  —45°  phase 
shift  at  the  carrier  frequency  just  follows  from  symmetry.  Together  there  is  a  phase 
shift  of  90°  between  LOT  and  LOq.  In  practice,  process  variation  on  R  and  C  causes 
phase  and  amplitude  errors.  Accordingly  this  circuit  is  not  suitable  when  there  is 
tight  quadrature  requirement  on  phase  and  matching  requirement  on  amplitude. 
In  addition,  the  input  impedance  of  this  generator  is  determined  by  RC.  For  high 
input  impedance  (to  avoid  loading  LO  circuit),  large  R  and  small  C  are  desirable. 
However  at  RF  frequency,  parasitic  capacitance  associated  with  R  starts  to  become 
significant  in  loading  the  LO  circuit.  Furthermore  as  C  becomes  smaller,  mismatch 
becomes  more  pronounced.  As  usual,  drivers  (typically  common  source  amplifier 
or  source  followers)  can  be  used.  Finally  instead  of  connecting  LOi,  LOq  directly  to 
the  mixers,  buffers  can  be  used  for  interfacing.  This  is  especially  important  if  the 
mixers  are  passive  mixers,  which  require  rail-to-rail  LOT,  LOq  swing.  With  a 
passive  RC/CR  network,  its  inherent  loss  will  be  made  up  for  by  the  buffers  to 
provide  such  swing.  However  these  buffers  stage  usually  consume  fair  bit  of  current 
and  the  extra  phase  noise  introduced  can  be  significant. 

Numerical  Example  9.1 

Design  a  single-ended  RC  quadrature  LO  generator  such  that  it  gives  a  90°  at  1GHz. 
Let  us  select  the  following  value:  C=Ci=C2=47pF,  R=3ohm,  Substituting  in 
(9.1),  (9.2)  there  is  a  phase  shift  of  90°  between  LOi  and  LOQ  at  1GHz.  To  show  this 
let  us  look  at  the  resulting  phase  plot  in  Fig.  9.7: 


i?ni 

1 

80 

^^^\ 

40 

\- 

0 

-60 

""*  —  _. 

100 

► 

180 


org  (*(/")) 


180        f»r*\ 

—  •arg(8(f)) 


10' 
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Here  6,  §  are  the  plots  of  RC  and  CR  sections,  respectively.  Note  from  the  phase 
plots,  their  phase  difference  at  1GHz  is  90°,  as  desired. 
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The  resulting  amplitude  plot  in  Fig.  9.8  is 


Frequency  (Hz) 


Fig.  9.8    Amplitude  plots 


From  the  amplitude  plot,  at  1GHz,  the  RC  and  CR  section  have  identical  amplitude, 
as  desired.  This  amplitude,  when  referenced  to  dc,  presents  a  loss  of  3dB,  as 
expected. 


9.3.2     Single  Ended  LC 

A  single-ended  LC  circuit  is  shown  in  Figure  9.9. 

In  this  circuit,  LOi  is  taken  directly  from  LO.  The  circuit  response  is: 


{(o)2C{LR2 


VLOe(cu)  = 

vlo,  (to)      (o))2LR2(2C2)  -  #2  -jcoL 


(9.3) 


Next  we  note  that  at  resonance,  the  phase  shift  between  LOq  and  LOi  is  shown 
in  Figure  9.10,  which  is  at  90°.  Thus  we  pick  the  value  so  the  circuit  resonates  at  the 
carrier  frequency. 


r\J  LO,=LO 


Fig.  9.9    Single  ended  LC  quadrature  LO  generator 
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Fig.  9.10    Relative  phase  of  90° 

Once  again  process  variation  causes  phase  and  amplitude  errors.  The  resonant 
frequency  (and  thus  the  frequency  at  which  we  want  a  90°  phase  shift)  can  be  tuned 
by  varying,  for  example,  C2.  This  can  be  achieved  by  implementing  C2  as  an  on 
chip  varactor.  The  issue  of  loading,  discussed  in  RC  quadrature  generator,  remains. 

Numerical  Example  9.2 

Design  a  single-ended  LC  quadrature  LO  generator  such  that  it  gives  a  90°  at  1GHz. 

Let  us  select  a  value  the  following  value:  Ci=  0.5pF,  C2=2.7pF,  R2=10ohm, 
L=4.7nH,  then  from  (9.3),  there  is  a  phase  shift  of  90°  between  LOi  and  LOq  at 
1GHz.  To  show  this  let  us  look  at  the  resulting  phase  plot: 

Note  from  the  phase  plot  in  Fig.  9.1 1,  its  phase  at  1GHz  is  90°. 
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Fig.  9.11    Phase  plot 
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The  resulting  amplitude  plot  in  Fig.  9.12  is: 
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Fig.  9.12    Amplitude  plot 
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From  the  amplitude  plot  in  Fig.  9.12,  at  1GHz,  there  is  resonance,  as  expected. 
At  resonance  the  phase  shift  across  Ci  is  90°.  The  QL  chosen  was  3.3,  which  is 
typical  on  chip  IC  spiral  inductor  Q's. 


9.3.3    R-C  with  Differential  Stages 


A  differential  implementation  of  the  single-ended  RC  quadrature  LO  generator 
shown  above  can  also  be  realized.  This  is  shown  in  Figure  9.13. 


Set  LO  as  LO  =  ALo  sin(coLot) ,  then 

LO!  =  (2R2/Ri)  x  (ALOsin(coLOt)) 
LOq  =  (2R2Ci)  x  0//df(ALOsin(coLot)) 
=  (0LO(2R1C)  x  (ALOcos(roLot))) 


(9.4) 


(9.5) 


Since  LOI;  LOq  depends  on  sin(coLOt)  and  cos(<DLOt),  respectively,  thus  they  are 
90°  out  of  phase. 
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Vdd 


Fig.  9.13    Differential  RC  quadrature  LO  generator 


9.3.4    Polyphase  IIQ  Generator 

Another  extension  to  the  single-ended  RC  quadrature  LO  generator  shown 
above  can  also  be  realized.  This  is  achieved  by  having  each  section  realizing  not 
90°,  but  90°/N  phase  shift,  where  N  is  the  number  of  RC  sections  (N=l  in  the 
single-ended  RC  quadrature  LO  generator).  This  is  called  polyphase  quadrature 
generator  [8,  11]  ,  since  internally  multiple  phases  corresponding  to  90°/N,  are 
generated.  The  advantage  of  polyphase  over  single-ended  RC  quadrature  LO 
generator  is  that  phase  and  amplitude  mismatch  due  to  R,  C  process  variation  is 
reduced  significantly. 

The  case  for  N=2,  and  with  LO  designed  for  1GHz,  is  shown  in  Figure  9. 14.  The 
circuit  in  Figure  9.14  has  4  inputs.  Two  of  them  come  from  LO  (plus  and  minus  if 
LO  has  differential  output,  plus  and  ground  if  LO  has  single-ended  output).  The 
other  two  inputs  are  connected  to  ground.  This  is  shown  in  Figure  9.15.  Its  phase 
response  is  shown  in  Figure  9.16. 

There  are  2  traces  here  that  correspond  to  the  phase  of  LOT  and  LOq.  Note  that  at 
1  GHz,  the  phase  difference  is  90° .  The  RC  values  of  the  first  stage  are  designed  for 
900MHz  center  frequency,  and  the  RC  values  of  the  second  stage  for  1 100MHz  center 
frequency.  By  skewing  the  values,  it  is  possible  to  achieve  a  broader  band  response 
around  1GHz.  Because  the  generator  contains  multiple  RC/CR  sections,  successive 
sections'  impedance  can  increase  progressively.  This  can  alleviate  the  loading  problem. 

In  summary,  the  quadrature  generators  in  this  section  provide  phase  splitting 
without  any  frequency  division,  and  they  are  passive  circuits.  These  imply  high 
linearity,  low  noise,  and  low  power  consumption,  which  make  them  ideal  for  many 
designs. 


Polyphase  concept  is  also  used  in  receiver  [7],  for  image  rejection 
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Fig.  9.14    Polyphase  quadrature  LO  generator 
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Fig.  9.16    Phase  plots  for  polyphase  I/Q  generator 
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Fig.  9.17    Digital  quadrature  LO  generator 
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Fig.  9.18    Waveforms  of  digital  quadrature  LO  generator 

9.3.5    Divider  Based  Generator 


The  digital  based  quadrature  LO  generator  [9]  is  shown  in  Figure  9.17a. 

Here  the  input  signal  is  the  LO.  It  is  first  fed  into  a  T  flip  flop  (FF),  called  T-FF1. 
This  is  realized  by  a  D-FF  connecting  the  Q\  (negative  output)  back  to  its  input. 
This  realizes  a  divide  by  2  function,  as  shown  in  the  second  trace  in  Figure  9.18,  and 
in  phase. 
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Meanwhile  LO  is  fed  into  the  at  (rather  than  CK)  of  the  second  T-FF,  T-FF2,  and 
edge  triggers  it  (in  essence  this  is  equivalent  to  inverting  LO  (which  gives  it  a  180° 
phase  shift)  and  edge  triggers  T-FF2).  Again  the  output  is  divided  by  2  and  is  (?2- 
The  phase  inversion,  meanwhile,  is  also  divided  by  2,  so  that  compared  to  Qi,  it  has 
a  phase  shift  of  90°,  as  shown  in  the  third  trace  in  Figure  9.18.  Thus  quadrature  LO 
is  generated. 

Figure  9. 17b  further  generalizes  this  by  feeding  the  output  of  Q2  (rather  than  Q\) 
back  to  D 1 .  In  addition  this  time  LO  is  fed  into  the  CK  (rather  than  at)  of  the  second 
T-FF.  This  circuit  realizes  also  quadrature  LO,  but  it  is  divided  by  4,  rather  than  2. 
The  waveforms  are  shown  in  the  last  2  traces  in  Figure  9.18. 

The  D-FF  used  in  these  circuits  is  typically  current  mode  based.  The  bandwidth 
of  operation  depends,  among  other  things,  on  the  load  impedance.  Typically 
broadband  operation  is  desired,  since  the  delay  at  all  harmonics  of  the  LO  signal 
is  maintained,  and  phase  mismatch  is  minimized.  For  broadband  operation,  the  load 
can  be  resistive,  (while  minimizing  parasitic  capacitance),  or  inductive. 

One  advantage  of  the  digital  based  generator  is  that  the  50%  duty  cycle  feature  is 
inherent.  In  addition,  amplitude  and  phase  imbalance  are  typically  small.  As  for 
phase  noise,  the  divide  by  2  and  divide  by  4  configurations  have  a  6-dB  and  12dB 
improvement,  respectively.  On  the  divide  by  2  circuit,  since  the  2  FF  are  triggered 
by  two  different  clock  edges,  clock  skew  can  be  a  problem  and  lead  to  mismatch. 
This  is  not  an  issue  in  the  divide  by  4  circuit  as  the  triggering  is  done  by  the  same 
clock  edge.  Also  since  the  LOi,  LOq  are  at  a  different  (lower)  frequency  than  LO, 
the  impact  of  feedthrough  and  unwanted  uncoupling  is  not  as  acute,  although  the 
LO  has  to  be  generated  at  a  higher  frequency. 


9.4     Power  Amplifier  Design 
9.4.1     Specs 

9.4.1.1     General  Discussion 

Like  LNA  in  chapter  3,  power  amplifier,  PA,  has  specs  for  its  gain  G,  NF  and  IIP3. 

To  demonstrate  how  this  is  obtained,  we  use  GSM  as  an  example.  Figure  9.19  is 
the  power  output  specification  [11]  for  GSM  (880MHz-915  MHz)  as  well  as  transmit 
spurious  in  the  transmitter  [14].  We  further  assume  the  PA  is  used  in  a  homodyne 
architecture,  as  shown  in  Figure  9.20,  where  the  filters  and  switch  are  assumed  to 
have  the  loss  as  indicated. 

From  Figure  9. 19,  it  can  be  seen  that  the  power  output  is  2W  (33dBm)  (Tx  band). 
Phase  accuracy  of  modulated  carrier  is  5°  rms  and  20°  peak. 

In  addition  spurious  from  transmitter  has  to  be  less  than  —  162dBc/Hz  at  the  Rx 
band(935MHz-960MHz)  so  as  not  to  interfere  with  its  own  received  signal.  Also  it 
has  to  be  below  -104dBc/Hz  at  the  Rx  band  of  DCS  1800(  1805MHz- 1880MHz)  so 
as  not  to  interfere  with  the  DCS  1800  Rx  band. 
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Fig.  9.19    PA  output  power  level  for  GSM 
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9.4.1.2     G  and  NF  Specs 

Next,  referring  to  Figure  9.20,  we  know  that  the  power  output  level,  (from  Figure 
9.19),  has  to  be  33dBm.  Let  us  further  assume  at  the  output  of  the  I,  Q  modulator, 
signal  is  9dbm. 

To  achieve  this,  G  has  to  be: 


G  =  30dB. 


(9.6) 


9.4     Power  Amplifier  Design  511 

To  see  this,  going  along  the  transmitter  chain: 

At  the  output  of  the  BPF,  signal  =  9dBm-4dB  (from  loss)  =  5dBm.  Thus  the 
amplitude  of  input  to  the  PA,  denoted  as  A^,  is 

Ain  =  5dBm.  (9.7) 

At  the  output  of  the  PA,  signal  level  =Ain(=5dBm,  from  (9.7))+G(=30dB,  from 
(9.6))  =  35dBm 

At  the  antenna,  signal  level  =  35dBm— 2dB(from  loss)=  33dBm 

This  satisfies  the  required  output  level  of  33dBm. 

Meanwhile  transmit  noise  in  the  receive  band  needs  to  be  kept  below  —  129dBm/ 
Hz  from  935MHz  to  960MHz.  This  is  because  from  Figure  9.19  spurious  from 
transmitter  has  to  be  less  than  -162dBc/Hz  at  the  Rx  band  (935MHz-960MHz). 
With  signal  level  i.e.  carrier  at  33dBm,  —  162dBc/Hz  (with  respect  to  carrier)  means 
-162dBc/Hz  +  33dbm  or  -129dBm/Hz. 

To  carry  on  the  noise  analysis,  let  us  further  assume  at  the  output  of  the  I, 
Q  modulator,  noise  at  frequency  larger  than  935MHz  (Rx  band)  is  less  than 
-150dbm/Hz. 

To  achieve  this,  NF  needs  to  be: 

NF  =  8dB  (9.8) 

To  see  this,  going  along  the  transmitter  chain: 

At  the  output  of  the  BPF,  noise  =  —  150dBm/Hz— 15dB  (from  frequency 
response  which  equals  —  15dB@935MHz)  =  —  165dBm/Hz 

At  the  output  of  the  PA,  noise  =  -165dBm/Hz  +  G(=30dB,  from  (9.6))  +  NF 
(=8dB,  from  (9.8))  =  -127dBm/Hz 

At  the  antenna,  noise  =  —  127dBm/Hz— 2dB(from  loss)=  —  129dBm/Hz, 
satisfying  the  specs. 

In  summary,  in  this  example  the  specs  for  G  is  30dB  and  NF  is  8dB. 

Next  let  us  discuss  IIP3. 


9.4.1.3     IIP3  Specs 

We  take  note  that  nonlinearity  of  the  PA,  as  quantified  by  its  IIP3,  is  going  to  affect 
the  GSM  specs  on  total  transmit  error.  Below  we  look  at  some  of  those  mechanisms. 


IIP3  Specs  Due  to  AM-PM  Conversion 

One  possible  mechanism  is  that  it  causes  AM-PM  conversion  [6]  and  that  the 
tolerated  amount  of  the  resulting  PM  is  limited  by  the  phase  error  specs  of  GSM. 
Referring  to  the  example   architecture  in  the  I/Q  modulator  as  shown  in 
Figure  9.20,  the  I,  Q  baseband  signals  are  modulated  on  the  carrier  and  applied  to 
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Fig.  9.21  Visualization  of  AM-PM  conversion  due  to  nonlinearity.  (a)  are  I  and  Q  components  of 
the  input,  (b)  are  I,  Q  components  of  the  output,  with  AM  impressed  on  the  input,  which  passes 
through  a  PA  with  nonlinearity. 


the  PA.  Shown  in  Figure  9.21a  are  visualizations  of  I  and  Q  components  of  the  PA 
input.  One  symbol  period  of  the  modulated  carrier  waveform  is  shown.  The 
envelope  is  practically  constant  (around  the  arbitrarily  set  value  of  1)  and  the 
modulation  is  on  the  phase.  The  circles  highlight  the  values  of  the  I,  Q  components 
at  the  end  of  the  symbol  period.  Going  through  the  calculation,  this  gives  a  phase  of 
220  degree.  Next  we  add  AM  to  the  PA  input.  We  then  pass  it  to  the  PA.  For 
visualization,  we  arbitrarily  set  the  PA  gain  to  around  8  and  also  add  an  exaggerated 
level  of  nonlinearity.  In  addition  the  amount  of  AM  is  highly  exaggerated,  so  that 
the  envelope  on  PA  output,  in  Figure  9.21b,  varies  a  lot.  Again  using  the  values 
highlighted  by  the  circles  at  the  end  of  the  symbol  period,  the  phase  is  180  degree. 
Thus  an  error  of  40  degree  is  made.  Also  as  we  look  at  consecutive  symbols  this 
phase  error  changes.  To  capture  this,  we  have  both  the  peak  phase  error  and  an  RMS 
(root  mean  square)  phase  error. 

Now  let  us  see  how  this  AM  to  PM  conversion  mechanism  arise  in  the  example 
architecture  shown  in  Figure  9.20.  We  first  simplify  Figure  9.20  so  that  BPF,  LPF, 
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Fig.  9.22    Ratio  of  wanted  to  unwanted  sideband 

switch  are  ideal  with  loss/gain  of  Odb.  From  9.2.2  if  we  have  only  tone  modulation 
(i.e.  I,  Q  components  are  a  single  tone  at  a>i),  we  have: 

I  baseband  signal  =  Ascos(coit),  Q  baseband  signal  =  Assin(coit)  (9.9) 


Let  us  represent  the  LO  signals  for  the  I,  Q  paths  (at  a>2  )  as: 

LOi  =   Accos(o)2t),  LOq  =  Acsin(fl)2t) 


The  resulting  modulator  output  is: 


AiAc(cos(coi.?).cos(co2.f)  +  .sin(coi.r)  sin(a)2./)) 


(9.10) 


(9.11) 


This  can  be  written  as  ^p  cos((<x>2  —  CQi)  '  t).  This  is  single  side  band  (SSB) 
modulation,  which  produces  a  single  tone  at  co2-tt)i. 

In  practical  implementation,  mismatch  between  I  and  Q  path  can  happen  in  the 
transmitter,  just  like  in  the  receiver  [15].  Going  back  to  Figure  9.2,  which  shows  the 
practical  realization  of  Figure  9.20,  there  is  a  DAC  for  the  I  path  as  well  as  one  for  Q 
path,  and  there  is  imbalance  between  the  two,  due  to  the  mismatch  in  circuit 
components.  Similarly,  there  is  a  mixer  for  the  I  path  as  well  as  one  for  Q  path, 
and  there  is  also  imbalance.  In  addition  there  is  also  imbalance  due  to  the  quadrature 
LO  generator,  described  in  9.3.  This  is  because  the  generator  does  not  generate  LOi 
and  LOq  that  are  exactly  90°  out  of  phase.  The  consequence  of  the  I,  Q  mismatch  is 
that  the  resulting  frequency  spectrum  no  longer  has  a  single  tone  at  CQ2-CO1,  but  has 
an  image.  This  is  shown  in  Figure  9.22,  where,  in  addition  to  the  desired  SSB  output 
at  cc^-Oh,  there  is  an  undesired  and  smaller  component,  at  <d2+<»i2. 

This  undesired  component  means  the  modulator  input  contains  AM.  In  general, 
even  though  the  single  tone  is  actually  the  message,  I,  Q  mismatch  is  still  going  to 


'Also  shown  is  a  frequency  component  at  0)2,  which  may  come,  for  example,  from  LO 
feedthrough 
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result  in  the  modulator  input  containing  AM.  If  we  approximate  the  PA  nonlinear 
transfer  function^  up  to  the  31    order  term  as: 

Vout{t)  =  0=0  +  «i  •  v/„(r)  +  a2  •  v,„2(f)  +  a3  •  v,„3(r)  (9.12) 

then  as  shown  in  Figure  9.21,  this  nonlinearity  converts  the  AM  to  PM  and  causes 
phase  error.  Specifically  the  odd  order  terms  in  (9.12)  is  responsible  for  AM  to  PM 
conversion  [1 1],  which  is  expressed  in  deg/dB  [6].  For  simplicity,  we  may  want  to 
represent  Vin  as: 

vin(t)  =  A(t)  •  sin[(co2  -  coi)  -t  +  4>(t)}  (9.13) 

Here  c|)(t)  represents  the  message.  A(t)  contains  the  Adesired  term,  where  Adesired  is 
the  amplitude  of  the  desired  signal  at  C02-ff>i  (shown  in  Figure  9.22),  or  the  RF 
amplitude.  A(t)  also  contains  the  modulating  signal  term. 

To  learn  a  bit  more  about  what  happens  when  vin  goes  through  the  nonlinearity, 
as  represented  by  (9.12),  and  generates  vout,  let  us  look  at  the  visualization  in 
Fig.  9.23.  Instead  of  writing  the  nonlinear  transfer  function  as  a  polynomial,  as  in 
(9.12),  a  compact  way  is  to  represent  it  is  the  use  of  a  tanh  (hyperpolic  tangent) 
function,  so  that  volll(t)  =  tanh(a  •  v,-„(?)).  Here  a  controls  the  amount  of  nonlinear- 
ity, with  a  increasing  as  the  PA  becomes  more  nonlinear.  We  also  use  low  frequency 
signals  to  simplify  analysis,  so  that  the  message  again  becomes  a  tone.  Thus  the  AM 
modulated  vin  is  represented  as  A(t)  ■  sin[(a)2  —  fi>i)  •  fj.  For  illustration,  we  exag- 
gerate the  level  of  modulation,  so  vin  is  written  as  (sin(2a)i  •  t)  ■  sin((<x>2  —  0Ji)t)), 
with  2d)!  as  the  modulating  frequency.  vout  is  now  given  as: 

vout(t)  —  tanh(a  •  sin(2a)i  •  f)  ■  sin((a)2  —  rai)0) 

In  the  visualization,  we  set  a=2,  ©2-0)1  about  10  times  2a>x. 

Plotting  we  have  Fig.  9.23,  where  AM  modulation  (exaggerated)  is  evident. 

Next  we  perform  an  fft  on  vout.  The  frequency  spectrum  is  shown  below  in 
Figure  9.24,  where  the  fundamental  and  3r  order  intermodulation  product  is  shown 
to  be  around  —  17db  below  the  fundamental  i.e.  IM3  =  —  17db(from  this  IIP3  can  be 
calculated).  In  general  as  a  (set  equal  2  here)  increases,  this  increases. 

Having  used  the  above  visualizations  to  clarify  how  nonlinearity  in  a  PA  causes 
AM-PM  conversion  in  the  example  architecture  shown  in  Figure  9.20,  let  us  next 
try  to  estimate  the  amount  of  conversion  and  hence  the  required  IIP3  that  satisfies 
the  specs. 

Given: 

For  GSM  (which  uses  GMSK),  the  phase  accuracy  of  modulated 

carrier  is  (5°rms  (20°peak)  in  Tx  phase  error).         (9.14) 


3  In  [  1 8] ,  more  advanced  model  using  Volterra  series,  like  in  chapter  4,  5  are  used  to  model  the  PA. 
This  includes  capacitive  effect,  which  further  moves  the  zero  crossing  and  aggravates  the  situation. 
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Fig.  9.23    Visualization  of  vom  when  an  AM  modulated  V;n  is  applied  to  a  PA  whose  nonlinearity 
is  represented  by  a  tanh  function:  waveform  plot  (1  symbol  period) 
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Fig.  9.24  Visualization  of  vollt  when  an  AM  modulated  V;n  is  applied  to  a  PA  whose  nonlinearity 
is  represented  by  a  tanh  function:  frequency  spectrum  (fundamental  and  first  few  intermodulation 
components) 


From  Figure  9.20  signal  level  of  input  to  PA=5  dBm  i.e.  Adesired=  5  dBm. 
The  amount  of  mismatch  between  I,  Q  path  is  selected,  with  the  resulting  undesired 
signal  (shown  in  Figure  9.22)  giving  an  AM  modulating  signal  whose  amplitude  is 
1/10  of  Adesired.  If  we  further  set  the  example  architecture  in  Figure  9.20  so  that  the 
resulting  coi  =  45MHz,  co2=  900MHz  (i.e.  0)2=200)!),  this  will  allow  us  to  set  up 
the  vin  given  in  (9.13).  Representing  the  PA  by  (9.12)  we  then  simulate  vout. 
The  simulation  is  first  run  for  a  given  a;,  a3,  and  the  rms  phase  error  is  calculated 
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Fig.  9.25    Phase  Error  vs  IIP3 


from  the  simulated  vout.  We  then  vary  (Xi,  a3  (and  hence  IIP3)  and  obtain  the 
corresponding  rms  phase  error.  A  plot  of  phase  error  vs  IIP3  is  given  in  Fig.  9.25: 
As  seen  the  phase  error  (RMS)  is  decreasing  (improving)  with  increasing  IIP3 
(more  linear),  as  expected.  At  around  -17dBm  the  phase  error  is  around  2.8  degree 
and  is  getting  close  to  the  GSM  spec  of  5  degrees.  Therefore  the  estimated  required 
IIP3  is: 


IIP 


3  — 


17dbm 


(9.15) 


Note  that  in  reality,  because  of  capacitive  effective  in  the  PA,  which  is  not 
modelled  in  (9.12),  AM-PM  conversion  is  worse  (see  also  footnote  [3])  and  the 
required  IIP3  is  higher. 


IIP3  Specs  Due  to  Distorted  LO 

The  limitation  on  IIP3  described  in  9.4.1.3.1  is  present  for  the  architecture  shown  in 
Figure  9.20,  regardless  of  the  exact  circuit  implementation.  Often  the  local  oscilla- 
tor does  not  generate  a  perfect  sine  wave  and  thus  it  has  higher  order  harmonics. 
Coupled  with  nonlinearity  of  the  PA,  this  is  going  to  cause  further  problem  [5]. 
Here  the  nonlinearity  intermodulates  unwanted  signals  and  reduces  the  SNR. 


9.4     Power  Amplifier  Design 


517 


LO 


LO+BB    3L0-BB 


L0-3BB 


LO+BB 


LO-BB 

A   4 


LO-3BB 


LO+BB 


Fig.  9.26    IIP3  due  to  LO  3     harmonic  mixing 


We  start  with  the  desired  signal,  whose  frequency  is  at  LO+BB,  as  shown  in 
Figure  9.26.  Here  LO  is  the  local  oscillator  fundamental  frequency  and  BB  is  the 
frequency  of  the  baseband  signal.  Now  typically  the  local  oscillator  does  not 
generate  a  perfect  sine  wave  and  thus  it  has  higher  order  harmonics.  Let  us  turn 
our  attention  to  its  3rd  harmonics,  3LO.  During  the  upconversion  of  the  baseband 
signal  via  mixing,  this  3LO  component  mixes  with  the  BB  component  and  creates  a 
frequency  component  at  3LO-BB.  Both  the  LO+BB  and  3LO-BB  components  are 
present  at  the  input  to  PA.  As  a  result  of  the  PA's  nonlinearity,  they  intermodulate, 
and  a  component  at  LO-3BB  is  generated  (LO+BB  component  mixes  with  3LO- 
BB  component,  forms  LO-3BB  component  by  the  formula:  (3LO-BB)-[2x 
(LO+BB)]=LO-3BB).  This,  if  not  properly  filtered,  the  LO-3BB  component 
looks  like  noise  to  the  desired  signal  at  LO+BB.  The  above  is  shown  in  Figure  9.26. 

From  (9.7)  A;n=5dBm.  We  now  calculate  the  allowed  IIP3. 

We  treat  the  PA  as  a  nonlinearity,  whose  input  signal  has  two  frequency 
components:  coj  (=LO+BB)  and  ©2  (=3LO-BB)  and  denote  their  amplitudes  as 
a^cci!  and  a2@co2 


Thus  :   ai@coi  =  Ajn  =  5dBm 


(9.16) 


We  now  assume  LO  is  a  square  wave,  so  that  the  amplitude  of  the  3LO 
component  is  approximately  1/3  that  of  the  amplitude  at  LO.  We  further  assume 
that  the  upconverted  baseband  signals  also  follow  the  same  ratios,  so  that: 


ai/  a2  =  1/3. 


(9.17) 
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Now  C0i  intermodulates  with  co2  to  form  the  co1-2co2  component.  To  find  the 
amplitude  of  this  component,  let  us  represent  the  PA  as: 

Vout{t)  =  a0  +  «i  ■  Vin(t)  +  rj.2  ■  vin2(t)  +  a3  •  v,-„3(f)  (9.18) 

They  pass  through  the  nonlinear  PA  and  from  (9.18),  the  amplitudes  of  vout 
different  frequency  components  are: 


Amplitude  of  0^  component's  output  =  o^xa^A;, 
Amplitude  of  co2  component's  output  =  (3 
This  ratio  should  be  the  required  SNR  i.e. 


Amplitude  of  co2  component's  output  =  (3/4)  xa3xa2xAin  (a!xAin) 


201og[(3/4)  x  a3  x  a2  x  Ain(ai  x  Ain)2]/  [ai  x  a:  x  Ain]  =  SNR  (9.19) 

To  find  SNR  we  start  from  (9.14),  which  states  the  phase  accuracy  of  modulated 
carrier  is  5°  rms.  In  general,  noise  causes  the  received  vectors  to  deviate  from  the 
reference  vector.  The  differences,  called  the  error  vectors,  have  random  amplitude. 
Numerous  ways  to  characterize  the  amplitude  of  the  error  vectors  (which  depends 
on  SNR)  have  been  reported  and  a  series  of  definitions  of  the  error  vector  magnitude 
(EVM)  given  in  [10].  For  ease  of  explanation,  we  go  through  the  following 
approximations  and  relate  the  amplitude  of  the  error  vectors  to  SNR,  and  hence 
transmit  phase  error,  below.  With  a  transmit  phase  error  of  5  degree  rms  this  means, 
in  the  worst  case,  the  transmitted  vectors(which  is  random),  is  tolerated  to  deviate 
from  the  reference  vector  by  5  degrees  in  an  rms  sense.  For  simplicity  of  calculating 
the  SNR,  let  us  assume,  in  an  rms  sense,  the  transmitted  vectors  and  error  vectors 
are  represented  as  a  single  transmitted  vector  as  well  as  a  single  error  vector, 
henceforth  denoted  as  the  transmitted  and  error  vector.  The  transmitted  vector  is 
constructed  by  rotating  the  reference  vector  by  5  degrees  (clockwise  or  counter- 
clockwise). We  further  assume  the  error  vector  is  given  by  the  vectorial  difference 
between  the  transmitted  vector  and  the  reference  vector.  Typically  the  magnitude  of 
the  error  vector  is  small  compared  to  the  reference  vector  and  thus  the  magnitude  of 
the  transmitted  and  reference  vector  are  about  the  same.  Hence  the  transmitted 
vector  can  be  viewed  as  obtained  from  the  reference  vector  simply  by  rotation, 
with  the  reference  vector  serving  as  the  radius  of  the  circle.  Furthermore  since 
the  rotation  angle  is  small  (5  degree  is  small  compared  to  360  degrees),  the 
magnitude  of  the  error  vector  is  approximated  by  the  length  of  the  arc  on  that 
part  of  the  circle  between  the  tips  of  the  transmitted  and  reference  vectors.  Let  us 
now  denote  the  error  vector's  amplitude  as  amp_error  and  the  transmit  vector's 
amplitude  as  amp_xmit.  Since  the  radius  of  the  circle  is  given  by  amp_xmit,  and  the 
length  of  the  arc  given  by  amp_error,  by  definition  we  have: 

amp_error  =    [(5  degree/360  degree)  x  (2n)]  x  amp_xmit  (9.20) 
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Fig.  9.27    Power  level  ramping  for  GSM 
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We  now  further  assume  SNR,  in  db,  is  roughly  given  as  the  ratio  of  amp_xmit  to 
amp_error,  in  a  log  sense.  Thus 


SNR  —  20  x  log[(amp_xmit/  amp_error)] 

=  20  x  log[(360degree/5degree)  x  {In)]  =  21db 


(9.21) 


Substituting  (9.19)  in  (9.21)we  have: 

201og[(3/4)  x  a3  x  a2  x  Ain(a]  x  Ain)2]/  [ai  x  ai  x  Ain]  =  21db         (9.22) 

Substituting  (9.16)-(9.17)  in  (9.22),  we  can  find  the  ratio  of  <Xi/a3,  which 
is  simply  IM3,  as: 


IM3  =   0.267,  or  about  -  lldb 


(9.23) 


Finally  we  have: 


UPsldbm  =   Am|dbm  -  (  IM3|db)/2  -  10.5dbm 


(9.24) 


520 


9     Transmitter  Architectures  and  Power  Amplifier 


n 


RF  detector 


'Control 


Fig.  9.28    PA  power  control  block  level  implementation 
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Fig.  9.29    PA  power  control  characteristic 


9.4.2    Power  Output  Control 


To  achieve  a  power  output  level,  specs  typically  includes  one  for  power  level 
ramping.  Shown  in  Figure  9.27  are  the  specs  for  GSM. 

Notice  that  there  is  also  power  control  specs.  For  GSM  it  is  from  33dBm  to 
5dBm  output  power,  or  28dB  range,  with  accuracy  to  ±3dB  at  lower  power  levels. 
Shown  in  Figure  9.28  is  a  block  diagram  illustration  of  how  power  control,  in 
general,  is  achieved. 

In  essence,  power  control  is  achieved  by  sensing  the  output  power  via  the  RF 
detector(basically  extracting  the  envelope  of  the  carrier),  comparing  it  with  a  control 
signal  (Vcontroi)  and  feeding  the  result  back  to  vary  the  gain  of  the  PA.  A  typical  Poul 
versus  this  Vcontrol  characteristic  or  transfer  function  (TF)  is  shown  in  Figure  9.29, 
where  it  is  seen  that  as  Vcontrol  increases,  Pout  initially  increases,  but  eventually  levels 
off.  This  TF  can  be  quite  nonlinear,  and  a  typical  variation  with  respect  to  Vcontroi  is 
also  shown  in  Figure  9.29.  This  variation  can  be  a  problem  in  a  closed  loop  circuit 
(the  present  case)  if  the  variation  is  large.  Typically  the  gain  variation  can  be  as  much 
as  100:1,  and  so  loop  stability  needs  to  be  analyzed  carefully. 
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9.4 .3    PA  Design  Issues 

Next  we  note  that  unlike  LNA,  which  amplifies  signal,  a  power  amplifier  (PA), 
transfers  power  from  dc  to  RF,  rather  than  amplifies  the  power. 

Since  Vdd  is  decreasing  for  scaled  technology,  thus  R  decreases.  Therefore  we 
need  a  matching  network  at  output  of  PA.  This  matching  network  is  similar  to  one 
discussed  in  LNA.  An  example  will  be  shown  below. 

Most  of  the  CMOS  PA  [22]  have  the  following  issues.  One  issue  is  oxide  break- 
down. This  limits  the  available  voltage  swing  at  the  drain  of  the  transistor.  As  a  result 
the  load  impedance  seen  by  the  transistor  needs  to  be  very  low  in  order  to  deliver  the 
required  amount  of  power,  without  damaging  the  device.  Furthermore,  the  current 
level  must  rise  in  proportion  with  the  reduced  voltage  swing.  Since  CMOS  has  a  poor 
transconductance,  the  sizes  of  transistors  can  be  quite  large  in  order  to  generate 
the  required  output  power.  In  general,  we  can  use  matching  network  and  increase 
Vdd.  Circuit  technique  such  as  cascoding  can  also  be  used.  Several  PA  can  also  be 
combined,  each  with  small  Pou,. 

Another  issue  is  power  efficiency.  Power  added  efficiency  (PAE)  is  defined  as: 

PABX-™  **•(*?  (9.25) 

Pi„(RF)+Pdc 

In  other  words,  it  is  the  ratio  of  output  power  to  the  sum  of  DC  input  power  and 
RF  input  power.  Maximum  PAE  depends  on  the  class  of  PA.  Class  A  amplifier  has 
the  low  efficiency  (50%),  with  efficiency  increasing  as  one  moves  to  class  B  (78%) 
and  then  class  C.  On  the  other  hand  the  linearity  decreases.  Because  it  is  linear, 
class  A  PA  design  is  more  straightforward,  while  nonlinear  class  B,  class  C's 
analysis  is  more  involved. 

Finally  integrated  PA  has  more  problems  with  thermal  runaway.  In  particular,  if 
power  level  is  high,  the  heat  generated  may  not  get  dissipated  quick  enough. 
This  will  significantly  change  the  operating  point  of  the  device,  and  may  also 
cause  stability  problem. 


9.4.4    Class  A  Amplifiers 

First,  a  class  A  amplifier  [12]  is  examined.  This  is  shown  in  Figure  9.30. 

LRFC  is  a  choke  inductance  for  biasing.  Ls  is  the  parasitic  inductance.  The 
matching  network  can  be  realized  as  in  [12].  For  example,  Match  2  in  Figure  9.30, 
transforms  RL,  the  50Q  load  from  the  antenna,  to  R,  as  shown  in  Figure  9.31,  via  the 
Q  of  LC  network.  This  network  consists  of  a  series  Cs,  a  parallel  Lp. 

Another  configuration  consists  of  a  series  Ls,  a  parallel  Cp  network. 

Next,  the  load  line  of  the  PA  circuit  is  given  by  Figure  9.32. 
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50Ohms 


Antenna 


The  average  dc  power  consumption  is  given  by 


Pdc  —  Ido  x  Vdd 


(9.26) 


The  optimum  drain  resistance  is 


R 


L-opt 


Vdd 

I  DO 


(9.27) 


The  maximum  RMS  power  delivered  to  the  load  is  given  by 


Vdd      I  do       Vdd  x  Id 

V2       V2  2 


(9.28) 
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Fig  9.32    Load  line  of  class  A 
power  amplifier 


From  Figure  9.32,  the  maximum  peak  signal  current  and  voltage  swing  will  not 
be  equal  to  IDO  and  VDd>  respectively,  due  to  the  VDs_sat  of  M; .  The  effect  of  the  Rs 
of  M]  (due  to  e.g.  contact  resistance,  interconnect),  and  Ls(package  +  bond  wire) 
will  also  cause  voltage  drop.  Therefore 


PA  efficiency,  r\  = 


<50% 


(9.29) 


Ground  bounce  is  due  to  M;'s  source  current  into  Rs  and  Ls.  Some  amount  of 
resistive  degeneration  in  the  source  is  desirable  to  provide  negative  feedback  and 
increase  stability.  Voltage  drop  across  inductance  Ls  is  undesirable  and  should  be 
minimized. 

Ground  bounce: 


V 'ground  bounce  =  RJd(<±>)  +J0jLsID(o>) 


(9.30) 


Note  that  the  voltage  across  Ls  is  in  quadrature  to  the  signal  current. 
If  the  input  signal  to  M;  is  sinusoidal,  with  amplitude  Vip,  then  the  drain  current 
equal  the  source  current  Is  as  shown  in  Figure  9.33,  and  is  given  by: 


lD(m) 


and  the  output  voltage  by 


V0(eo) 


Vjp  sin(cur) 
Rs  +jcoLs 


RLVip  sin(cof) 


(9.31) 


Rs  +  jcoLs 
Maximum  peak  to  peak  output  voltage  at  the  drain  of  Mi  is  given  by 

V0_max(c»)  «  Vdd  -  VoS^at  +  h(Rs  +  JC0LS) 


(9.32) 


(9.33) 
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Fig  9.33    Maximum  output 
swing  with  parasitic 
resistance,  inductance 
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With  the  load  identified,  the  stability  of  the  PA  has  to  be  checked.  S-parameters 
should  be  used.  Unconditional  stability  can  be  ensured  if  the  following  conditions 
[16]  are  satisfied: 


Ks 


1-|S„|2-|S; 


-i 


2\SnS 


>l; 


|A|<1 


(9.34) 


21012 


where  A=det([S])=S1iS22-Si2S2i. 

A  circuit  is  defined  as  unconditional  stable  if  there  is  no  passive  source  and 
passive  load  combination  that  can  cause  the  circuit  to  oscillate  [18]. 

Alternatively  unconditional  stability  can  be  ensured  if  the  following  condition 
[18]  is  satisfied: 


1 


ft 


\Su 


S22 


■Si,  A 


\S21S 


>1 


(9.35) 


12I 


where  again  A=det([S])=SnS22-Si2S2i- 

[18]  also  provides  geometrical  explanation  in  terms  of  mapping  (e.g.  mapping 
between  input/output  reflection  coefficient  and  load  /source  reflection  coefficient) 
that  can  help  to  guide  the  design  in  achieving  stability. 

Numerical  Example  9.3 

Consider  designing  a  class  A  RF  single  stage  PA.  Assume  the  transistor  can  deliver 
2W  output  (challenging  for  CMOS)  with  the  following  parameters: 

Vdd  =  4.8V,  Idq  -  1A.  Ls  =  O.lnH,  Rs  =  0.2Q.  Vds_sat  =  0.6V 

From  (9.27),  we  have  RL_opt  =  4.8Q.  (9.36) 
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Fig.  9.34    Reduction  in 
supply  voltage  and  impact  on 
PA  design 
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If  we  ignore  Vds_sal=0.6V  and  source  voltage  drop,  the  maximum  output  power 
would  be  (from  (9.28)): 


Pou,=   l/z  IooVdd  =  2.4W 


(9.37) 


From  (9.29),  r|=50%. 

At  1GHz,  the  maximum  output  power  would  be  less  than  this,  and  is  given  by: 


t(o) 


1 


'DO 


vdd 


DS^nl 


l(Rs 


j2L5 


1.95W     (9.38) 


In  order  to  obtain  higher  power  in  class  A,  the  DC  operating  (bias)  current  IDO, 
would  have  to  be  increased  to  compensate  for  the  extra  voltage  drop  i.e.  the  slope  of 
the  ac  load  line  would  be  increased. 

The  small  signal  voltage  gain  is  given  by  (if  Ls  is  ignored) 


R„ 


4.8 


Rs      0.2 


24  =  21.6dB 


(9.39) 


Thus  the  G  and  power  level  of  this  design  is  slightly  below  the  specs  of  the 
example  PA  for  GSM  (shown  in  Figure  9.20).  The  actual  voltage  gain  at  1GHz  is 
less  due  to  device  capacitance,  which  has  not  been  included.  Nevertheless  the  large 
amount  of  gain  at  high  frequency  can  be  a  problem,  and  the  amplifier  design  needs 
to  be  checked  for  potential  instability  over  frequency. 

Plot  in  Figure  9.34  is  RL  versus  Vdd.  Using  (9.27),  for  a  given  power  output  P0, 
we  can  derive 


Rl(Vm)  = 


v/2 
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Fig.  9.35    Example  3  stage  Class  A  power  amplifier 


We  then  plot  RL  versus  Vdd.  The  result  shows  that  as  Vdd  decreases,  RL 
decreases  This  RL  may  not  be  matched  to  the  50Q  output.  Thus  we  need  a  matching 
network,  as  discussed  above. 

An  example  core  (without  the  matching  network)  class  A  PA  [3]  design  is  given  in 
Figure  9.35.  This  consists  of  three  stages  to  give  it  the  necessary  gain.  The  first  stage  is 
a  differential  pair,  and  operates  in  class  A.  Cascode  devices  are  used  to  mitigate 
breakdown  problem  since  the  voltage  supply  is  3.3V.  The  gates  of  these  devices  are 
connected  to  the  power  supply.  The  output  is  fed  into  Lin,  Lip,  which  serves  as 
inductive  load.  They  are  on  chip  inductors.  This  is  then  ac-coupled  via  Cln,  Clp  to  the 
second  stage  (M2n,  M2p)  and  then  into  the  third  stage  (M3n,  M3p).  The  second  and 
third  stages  are  common  source  amplifiers,  and  are  biased  through  the  resistors  to  keep 
them  in  class  A  operation.  Level  shifting  between  each  stage  is  achieved  through 
capacitors  Cln-p  to  C3n-p.  L4n,  L4p  are  on  chip  inductors,  and  are  added  to  resonate 
with  gate  capacitances  of  M3n,  M3p.  This  allows  the  level  shifting  capacitors  C2n-p  to 
be  kept  below  2pF.  There  is  also  a  power  control  loop  (not  shown),  performing  power 
control  function,  as  described  in  9.4.2.  This  loop,  like  Figure  9.28,  includes  a  RF  peak 
signal  detector,  a  comparator  and  24dB  of  adjustable  transmitter  gain.  The  power 
is  adjusted  in  steps,  as  shown  in  Figure  9.29,  in  steps  of  0.5dB.  Vout+  and  Vout~  are 
fed  to  an  off  chip  balun.  This  PA  delivers  a  22dBm  peak  output  power  and 
17.8dBm  average  power. 
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Fig.  9.36    Class  AB/B/C  power  amplifier 


9.4.5    Class  AB/B/C  Amplifiers 


To  improve  efficiency,  PA  does  not  need  to  conduct  all  the  time.  This  leads  us  from 
class  A  amplifier  to  class  AB  amplifier.  Thus,  for  example,  the  amplifier  for  class 
A  (Figure  9.30)  is  redrawn  here  (without  matching  network)  in  Figure  9.36. 

As  shown  the  difference  between  different  classes  depends  on  the  bias  condition, 
for  the  same  circuit.  On  the  right  hand  side  of  the  diagram  is  I-V  plot  of  the 
transistor.  When  gate  voltage  (set  up  via  Vgg)  is  below  the  threshold  voltage, 
drain  current  is  zero.  The  voltage  at  which  the  gate  should  be  biased  to  operate 
the  PA  in  class  A  is  the  rightmost  dotted  line.  As  the  biased  voltage  moves  towards 
the  left,  operation  becomes  class  AB,  class  B,  then  class  C.  Conduction  angle 
represents  the  %  of  a  cycle  when  the  PA  conducts  (i.e.  drain  current  larger  than 
zero).  For  class  A  it  is  2tt  or  360°  since  the  PA  conducts  all  the  time.  For  class  B  it  is 
7i  or  180°  since  the  PA  conducts  half  the  time.  Class  AB  is  somewhere  in  between. 
Class  C  is  below  n  or  180°.  Since  the  conduction  angle  goes  down,  the  amplifier  is 
off  for  some  part  of  the  cycle.  This  means  the  dc  power  dissipation,  Pjc,  is  reduced 
and  hence  efficiency  increases,  according  to  (9.29).  This,  however,  also  means  PA 
is  more  nonlinear.  In  spite  of  being  more  nonlinear,  in  the  GSM  system,  though,  it  is 
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possible  to  operate  the  PA  in  Class  AB  mode  and  thus  obtain  better  efficiency  than 
in  pure  Class  A  operation;  this  is  possible  due  to  the  nature  of  the  modulation  and 
the  resulting  constant  RF  envelope. 

Note,  so  far,  whether  PA  is  class  A,  B,  or  C,  the  load  remains  resistive. 
This  means,  the  amplifier  (without  biasing),  is  similar  to  Figure  9.33,  where  the 
load  consists  of  R0.  Output  power  and  efficiency  can  be  improved,  though,  by 
making  use  of  more  sophisticated  load,  such  as  harmonic  tuning  [6].  Basically  this 
consists  of  selecting  load  so  that  it  becomes  a  short  circuit  at  harmonic  frequency. 
Such  load,  will  allow  us  to  simultaneously  maximize  fundamental  frequency 
voltage  and  current  swings.  Thus,  whereas  with  resistive  load  and  for  class  B,  C 
operation,  the  drain  current  is  clipped,  as  shown  in  Figure  9.36,  with  tuned  load, 
the  fundamental  frequency  of  the  drain  current  remains  almost  a  full  sinusoid. 
This  maximizes  fundamental  current  swing  (high  frequency  component  of  current 
remains  distorted  though). 

As  a  side  note,  the  concept  of  class  A,  AB,  C  shares  similar  concepts  in 
the  design  of  output  driver.  Similarly  the  concept  of  efficiency  and  nonlinearity 
are  relevant.  However  in  the  design  of  output  driver,  low  output  impedance  is  the 
design  objective,  which  is  not  necessarily  true  in  PA  design  (here  output 
impedance  needs  to  be  matched). 

Let  us  now  discuss  an  actual  implementation  of  a  class  C  PA. 

As  stated  above,  here  the  transistor  is  biased  such  that  it  is  off  for  more  than  one- 
half  the  cycle  by  setting  the  dc  bias  below  the  threshold  voltage  of  the  device. 
An  example  core  (without  the  matching  network)  class  C  PA  [5]  is  shown  in 
Figure  9.37.  It  is  a  three-stage  fully  differential  design.  The  stages  are  ac-coupled  in 
order  to  allow  the  input  transistor  of  each  stage  to  be  biased  independent  of  the  output 
of  the  previous  stage.  Since  the  PA  was  to  be  used  in  an  integrated  environment,  the 
previous  stages  would  likely  not  be  able  to  drive  a  heavy  load.  Therefore  three  stages 
were  used  in  order  that  the  first  stage  would  not  load  down  the  output  of  the  mixer,  and 
also  gives  it  the  necessary  gain.  The  first  stage  is  a  differential  pair  and  operates  in  a 
class-A  fashion,  with  a  variable  tail  current  source  that  controls  the  gain  and  thus  the 
output  power  level.  The  final  two  stages  are  also  differential  pairs,  but  with  tail  node 
connected  directly  to  ground,  rather  than  to  a  current  source.  Thus  by  setting  the  proper 
bias  at  the  gates  of  the  input  differential  pair,  they  can  be  biased  to  be  on  for  less  than 
half  of  the  cycle  i.e.  in  class  C  operation.  The  second  and  third  stage  also  use  a  cascode 
structure  with  the  top  transistor  biased  at  the  supply  in  order  to  reduce  the  stress  on  the 
oxide  due  to  voltage  excursions  at  the  outputs  of  those  stages.  At  the  input  to  the  final 
stage,  a  different  tuning  technique  was  used  to  peak  the  gain  at  the  desired  frequency 
(in  much  the  same  way  as  the  narrowband  LNA  of  chapter  3).  Due  to  the  high  current 
drive  required  and  the  poor  transconductance  of  CMOS,  the  output  transistor  size  was 
very  large  (18300u/0.35u).  Using  a  single  spiral  inductor  (as  discussed  in  chapter  3)  to 
tune  out  all  the  capacitance  between  the  second  and  third  stages  requires  a  very  small 
value  of  inductor  (on  the  order  of  O.lnH).  This  inductor  was  quite  difficult  to 
implement  without  parasitic  trace  inductance  affecting  the  tuning  at  that  node. 
Furthermore,  inductors  were  used  across  the  differential  cascode  node  in  the  second 
and  third  stages,  in  order  to  reduce  the  current  required  to  charge  and  discharge  the 
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Fig.  9.37    Class  C  power  amplifier 


capacitors  at  those  nodes  and  boost  the  efficiency.  The  inductors  used  at  the  output  of 
the  first  and  second  stages  as  well  as  those  at  the  cascode  nodes  were  implemented  as 
spiral  inductors  on  chip.  The  other  inductors  used  in  this  circuit  were  implemented 
as  bond  wires,  in  order  to  take  advantage  of  their  higher  Q.  The  inputs  of  the  second 
and  third  stages  were  biased  at  0.35V. 


9.4.6    Choice  of  Class  A  vs  ABIC  Amplifiers 


There  are  many  issues  ranging  from  efficiency,  power  output  level,  required  linearity 
etc.  that  determine  this  choice.  One  of  the  important  considerations  is  whether  the 
required  linearity  allows  one  to  use  class  AB/C  or  one  has  to  use  class  A  PA.  We  again 
use  the  required  IIP3  derived  above  for  the  architecture  in  Figure  9.20  as  an  example. 
For  this  architecture,  from  (9.15),  IIP3  of  around  -17dBm  is  required.  A  Class 
AB/C  amplifier,  even  though  nonlinear,  can  typically  satisfy  this  requirement. 
With  its  better  efficiency,  it  is  an  attractive  choice. 
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Such  a  class  C  PA  has  been  designed  and  simulated  in  0.35u  CMOS  technology. 
The  core  topology  is  similar  to  that  in  Figure  9.37,  with  W/L  of  1st,  2"  ,  3r  stages 
given  as  800u/0.35u,  2500u/0.35u  and  18300u/0.35u,  respectively.  As  the  W/L 
ratios  become  large,  a  single  large  transistor  is  configured  as  identical  smaller 
transistors  connected  in  parallel.  The  current  for  the  1st  stage  is  20mA.  Biasing 
at  second  and  third  stages  are  set  at  0.35V.  IIP3=  -8dBm,  which  satisfies  the 
requirement.  Efficiency  is  69%. 

Next,  with  this  architecture,  when  we  take  into  account  additional  circuit  non- 
ideality,  such  as  distorted  LO,  from  (9.24),  IIP3  required  is  10.5dbm.  This  is  higher 
and  so  the  required  PA  has  to  be  more  linear. 

Typical  class  AB/C  PA's  IIP3  is  lower  (for  example  the  above  class  C  amplifier, 
with  an  IIP3  of  -8dBm,  does  not  meet  the  requirement),  and  so  it  cannot  be 
used  directly.  A  Class  A  amplifier's  IIP3  can  typically  satisfy  this  requirement. 
Thus  it  can  be  applied  directly,  at  the  expense  of  lower  efficiency. 

Such  a  class  A  PA  has  been  designed  and  simulated  in  0.1 8u  CMOS  technology. 
The  core  topology  is  like  the  one  in  Figure  9.35.  (W/L)ln-p=800u/0.18u,  (W/L)2n- 
p=2500u/0.18u,  (W/L)3n-p=5000u/0.18u.  The  cascode  devices  are  set  the  same 
value  as  the  drive  devices.  Itaii  for  first  stage  is  14mA.  Biasing  at  second  and  third 
stages  are  set  at  0.5V  and  0.9V,  respectively.  IIP3=  13dBm.  This  is  more  linear  than 
the  IIP3  of  —  8dBm  achieved  with  the  class  C  PA  above,  and  is  linear  enough  that 
the  PA  satisfies  the  requirement.  Efficiency  is  only  28%  and  is  poorer  when 
compared  to  the  69%  for  the  class  C  PA. 

If  power  efficiency  is  a  requirement,  we  may  have  to  revert  back  to  using  a 
nonlinear  class  AB/C  PA.  What  about  meeting  the  IIP3  requirement?  Remember  in 
deriving  the  IIP3  specs,  specifically  in  9.4.1,  we  have  simplified  Figure  9.20  so  that 
BPF  is  ideal  with  a  loss  of  Odb,  and  thus  is  assumed  to  have  a  flat  frequency 
response.  In  practice  this  filter,  which  is  before  the  PA,  serves  as  a  Pre-PA  RF  filter 
[5]  and  attenuates  the  frequency  component  at  3LO-BB.  If  we  implement  this  with  a 
two  pole  filter,  we  can  provide  a  19db  suppression  at  3LO-BB.  Then  the  required 
IIP3  of  10.5dbm  is  relaxed  by  19db  to  — 9.5dbm.  Again  a  Class  AB/C  amplifier  can 
typically  satisfy  this  requirement  (for  example,  the  class  C  PA  designed  and 
simulated  above,  with  an  IIP3  of  — 8dBm,  satisfies  this  requirement).  Thus  there 
is  a  trade-off  between  the  design  of  PA  (e.g.  class  A  vs  class  C)  and  design  of  filter 
(e.g.  1-pole  versus  2-pole).  This  filter,  however,  likely  is  a  discrete  filter  because 
component  using  CMOS  integration  has  low  Q,  and  is  thus  costly.  An  alternative 
modification  is  with  the  use  of  the  harmonic  rejection  transmitter  architecture  [5], 
which  does  not  require  this  filter.  This  architecture  rejects  the  3LO  and  thus  less 
intermodulation  product  is  present,  thus  reducing  the  required  IIP3. 


9.4.7    Class  E  Amplifiers 

To  improve  efficiency  even  further,  one  can  use  switching  mode  PA.  Even  though 
there  are  a  variety  of  switching  mode  PA  (e.g.  Class  D,  class  S  etc.),  vast  majority  of 
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attention  was  devoted  to  class  E  scheme,  because  it  is  simple  to  implement  and  also 
has  simple  design  relationships [6,  21].  Shown  in  Figure  9.38  is  a  conceptual  picture 
of  a  class-E  PA  [21]. 

The  transistor  is  represented  as  a  switch.  Cp  includes  its  output  parasitic 
reactance,  the  stray  capacitances  and  an  explicit  capacitor.  Its  value  should  be  set 
so  as  not  to  short  circuit  all  the  voltage  harmonics.  Ls  Cs  form  a  resonant  circuit  that 
acts  as  a  bandpass  filter.  It  is  designed  so  that  the  center  frequency  is  the  RF 
frequency.  The  load  is  represented  by  RL  (following  [21],  we  assume  resistive 
load  only,  although  in  practice  there  can  be  an  inductance  in  series  with  it  [6]). 

To  explain  the  operation,  let  us  start  with  Vin,  a  square  wave  at  the  RF  frequency, 
as  shown  in  Figure  9.39.  Therefore  it  opens  and  closes  the  switch  at  the  RF 
frequency.  During  the  ON  phase,  Vs  is  shorted  to  ground  and  stays  there. 
Since  Vs  remains  constant,  the  voltage  across  the  capacitor  Cp,  Vc  (=VS)  remains 
constant  and  from  the  relation  Ic=Cp(dVc/dt),  we  have  Ic=0.  Now  jumping  to  the 
output,  since  Ls,  Cs  forms  a  bandpass  filter  with  center  frequency  at  the  RF 
frequency,  then  we  assume  the  output  voltage  V0  contains  just  the  fundamental 
frequency  at  the  RF  frequency  i.e.  all  harmonics  are  filtered  and  V0  is  just  a 
sinusoid.  Let  us  further  assume  the  phase  of  the  sinusoid  is  set  so  that  it  starts 
from  negative,  as  shown  in  Figure  9.39.  Since  V0  and  Vin  are  designed  to  have  the 
same  frequency,  thus  during  ON  phase,  V0  stays  negative.  Since  V0  is  developed 
across  a  resistive  load,  RL  only,  the  current  I0  has  the  same  phase,  and  stays  negative. 
Next  we  observe  that  VL,  the  voltage  across  the  inductor,  which  equals  Vdd-Vs,  is 
simply  Vdd  since  Vs  was  already  shown  to  be  zero  during  this  ON  phase.  Thus  from 
the  relation  Il=J  (Vl  /L)dt=  Il=J  (Vdd  /L)dt  we  have  IL  increasing  linearly. 
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Fig.  9.39    Timing  diagram  qki 
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Then  the  current  through  the  switch,  Is,  given  as  Is=Il-Ic-Io=:  Il-Io>  rises  above  zero 
and  stay  above  zero,  as  shown. 

During  the  OFF  phase,  switch  is  open,  ls=0.  Then  current  is  directed  so  that  Ic 
jumps  to  a  positive  value  and  charges  up  the  capacitor.  Thus  Vs  (=VC),  from  the 
relation  Vc= J  (Ic  /  Cp)dt,  starts  to  rise  from  0.  When  Vs  increases  above  the  power 
supply  Vdd,  though,  the  voltage  drop  across  L  turns  negative  and  from  relation  Il=J 
(Vl  /L)dt=  J  ((Vdd-Vs)  /L)dt,  IL  starts  to  decreases,  as  shown.  Meanwhile  V0  rises 
above  zero  and  since  V0  and  Vin  are  designed  to  have  the  same  frequency, 
thus  during  the  OFF  phase,  V0  stays  positive.  Then  the  current  I0  also  rises  above 
zero  and  stays  positive. 

In  summary,  looking  at  power  dissipation  across  the  PA  i.e.  the  switch,  during 
the  ON  phase,  its  voltage  Vs=0  and  during  the  OFF  phase,  its  current,  Is=0 
Therefore  at  no  point  in  time  are  both  values  nonzero.  That  means  the  power, 
while  equals  the  product  of  the  two,  is  zero  all  the  time.  This  is  the  reason  why  class 
E  PA  has  high  efficiency. 

In  practice,  we  can  phase/frequency  modulate  Vin.  Thus  the  switch  is  not  just 
supplying  power  from  supply  to  the  load,  but  also  amplifying,  via  modulation,  the 
input  signal  to  the  output. 
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9.5     Problems 

9.1  A  modified  direct  conversion  architecture  is  shown  in  Figure  P9.1,  where  the 
oscillator  frequency  is  offset  from  the  carrier  frequency  (set  equal  to  fLO 
i  fLo  )■  (a)  What  is  the  advantage  over  direct  conversion  architecture  (b) 
What  is  the  difference  with  an  indirect  conversion  architecture? 

9.2  (a)  For  the  direct  conversion  architecture,  explains  how  the  pre-PA  filter 

reduces  potential  third-order  intermodulation  in  the  PA. 
(b)  For  the  indirect  conversion  transmitter  architecture,  explain  its  perfor- 
mance advantages  over  the  direct  conversion  transmitter,  apart  from  what 
is  given  in  the  chapter. 

9.3  For  the  harmonic  rejection  transmitter,  explain  how  it  offers  advantage  over 
the  indirect  upconversion  transmitter  in  terms  of  reducing  the  potential  third- 
order  intermodulation  in  the  PA. 

9.4  What  are  some  of  the  potential  drawbacks  of  using  passive  methods  such  as 
RC  filter,  polyphase  filter  as  quadrature  LO  generator? 

9.5  For  the  differential  RC  quadrature  LO  generator,  derive  the  relation  between 
Ri,  k>lo>  C  and  R2  if  LOi  and  LOq  have  equal  amplitudes. 

9.6  What  are  some  of  the  loss  mechanisms  of  a  polyphase  filter  based  quadrature 
LO  generator?  A  quadrature  LO  generator  is  shown  in  Figure  P9.2  that  tries  to 
combat  this.  How  does  it  minimize  some  of  the  loss? 


L02 
L02  >   fLO, 


LOi 


Fig.  P9.1    Modified  direct  conversion  architecture 
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Fig.  P9.2    Quadrature  LO  generator 

9.7  In  a  quadrature  LO  generator,  because  of  mismatch  in  the  circuits,  there  are 
amplitude  and  phase  mismatch  between  LOi,  LOq.  Assume  amplitude  and 
phase  mismatch  is  represented  as  follows: 


Q:  (1+A): 


I:  cos(cuit),  LOT:  cos(co2t) 


isin((Dit+(j)),  LOQ:  sin(co2t) 


Amplitude  (A)  and  phase  error  ((())  is  included  in  the  quadrature  signal  path, 
all  amplitudes  are  normalized  to  1. 

a)  Shows  that  it  results  in  frequency  at  co1+co2  (undesired  side  band)  and  cor 
co2  (desired  sideband)  as  shown  in  Figure  9.22.  Give  an  expression  of  the 
ratio  of  error  of  desired  to  undesired  sideband,  S(A,  40- 

b)  Plot  S  (take  lOlog  of  it)  in  dB  as  a  function  of  A,  <j).  A  varies  from  OdB  to 
3dB.  4>  varies  from  0.1°  to  100°  (in  log  scale).  Thus  this  is  a  set  of  parametric 
plots  on  a  linear-log  plot,  with  S  (dB)  in  y-axis  (linear),  c|)  in  x-axis  (log),  and 
A  as  parameters. 

In  a  PA,  the  matching  network  shown  in  Figure  P9.3  (which  consists  of  a 
series  L,  parallel  C  network)  is  provided  to  match  50Q  to  10Q. 


50ohm 


Fig.  P9.3    Matching  network 


Vi 


6.36nH 


12.7pF         10ohm 
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Plot  the  frequency  response  of  the  above  circuit  i.e.  201ogV2/V1  in  dB 
vs  frequency  (linear-log  plot).  The  frequency  span  should  be  from  10MHz- 
1.5GHz. 

From  the  frequency  response,  at  what  frequency  does  the  matching  occur? 
The  frequency  response  of  the  above  circuit  can  be  approximated  by  the 
frequency  response  of  the  following  circuit  shown  in  Figure  P9.4.  Plot  the 
frequency  response  of  this  approximated  circuit  i.e.  201ogV2/Vi 


50ohm 


Fig.  P9.4    Circuit 
approximating  matching 
network  in  frequency 
response 


Compare  the  two  frequency  plots  in  terms  of  a)  center  frequency  b)  bandwidth 
c)  Q  d)  general  shape 
'.9  Calculate  the  p-p  sinusoidal  input  voltage  for  HD3=1%  in  the  circuit  shown 
in  Figure  P9.5. 
FET  data: 

NMOS:    LD=0.1V,    VTO=0.7,    GAMMA=0.5,    PHI=0.6,    KP=70E-6, 

LAMBDA=0.03,  CGD0=0.3E-9,  CGS0=0.3E-9,  CJ=0.4E-3,  MJ=0.5, 

TOX=25E-9,  CJSW=0.4E-9,  MJSW=0.5,  PB=0.6 
PMOS:     LD=0.1V,    VTO=0.7,    GAMMA=0.5,    PHI=0.6,    KP=70E-6, 

LAMBDA=0.03,  CGD0=0.3E-9,  CGS0=0.3E-9,  CJ=0.4E-3,  MJ=0.5, 

TOX=25E-9,  CJSW=0.4E-9,  MJSW=0.5,  PB=0.6 
PMOS  WELL:  CJSO  (FF)  =  0.1  x  (area  bottom  urn2)  +  1.1  x  (perimeter  urn) 
MOS  dimensions: 


WELL  for  PMOS 


3um 

3um 

3um 

s 

W    G 

D 

3um 
3um 

L       ' 

3um 

3/2 


M, 


10K 


:5V 


0.1mA 


+  V0   - 

-o  o- 


3/2 


Fig.  P9.5    Circuit  used  to  investigate  distortion  in  one  stage  of  PA 
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Recalculate  if  W/L  ratio  of  M;  and  M2  is  changed  to  30000/2.  Show  that  it  is 
impractical  to  achieve  the  same  HD3. 


Vnn  =  5V 


Mi 
3/150 


+o- 


M2 
1/2 


M, 


10/2 


10K 


Vi 


-o    Vn 


M4 
10/2 


20/2 


Fig.  P9.6    Circuit  used  to  investigate  distortion  in  cascading  stages  in  a  PA 


9.10  For  the  circuit  shown  in  Figure  P9.6  the  dc  voltage  V;  is  adjusted  for  Vn  = 
2Vdc. 

a)  Calculate  bias  ID  and  VGS  for  each  device. 

b)  Calculate  HD2  and  HD3  (in  dB)  in  V;  and  Vn  for  a  sinusoidal-signal 
voltage  of  0.2V  rms  at  V0.  FET  data  as  in  9.9  but  add  Rs=25kQ/W(um) 
in  series  with  each  source.  Neglect  charge  storage  and  Early  effect. 
Hint:  Start  by  deriving  a  power  series  for  V,  as  a  function  of  Vi.  Then 
derive  Vi  as  a  function  of  V;.  Neglect  body  effect  in  M5  but  not  in  M3. 

c)  Repeat  the  problem  but  M5  W/L  ratio  is  increased  by  4000  times  to  80000/ 
2,  and  also  decrease  the  drain  resistance  Rd  to  2.5ohms  (4000  times). 
Also  make  IMs  increase  by  4000  times. 
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Third-order  intermodulation,  76,  77,  92,  95,  97 

Time-delay  spread 

Time  diversity  envelope  fading 

Time  division  duplexing  (TDD),  3,  5 

Time  division  multiplexing  access  (TDMA), 

3,  5-9 
Time  scaling,  422,  423 
Time  varying  channel 
Time  varying  distortion,  261-271,  273,  274, 

279,  280,  282,  287 
Time-varying  Volterra  series,  261,  270,  279 
Tracking  error,  235,  283 
Trade-off  between  noise  figure  and  power, 

148-149 
Trade-offs  between  noise  and  gain,  107 
Transformer,  391-394,  396,  397 
Transistor  mismatches,  235 
Tuning,  352,  354,  385,  400^102,  404,  408^114 


U 

Unbalanced  mixer,  166,  167,  171,  208,  209, 
211,218,220-227 

sampling,  273 

switching,  232-233,  236-250 
Unity  gain  bandwidth,  311,  314,  316,  330,  333 
Unity  gain  frequency,  445,  456,  459^163, 

476^177 
Unsaturated  ring  oscillator,  427^-32 
Unwanted  sideband,  513 


Index 
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Varactor,  385,  400-402,  414 

V'°  not  switching  case,  242-243,  245-246, 

257,  259-261,  275,  277,  281,  282,  286 
Voltage  controlled  oscillator  (VCO),  351-356, 

367,  368,  370,  371,  385-391,  401, 

417—419 
Volterra  series,  179-193,  210,  211 


W 

Wideband  LNA,  113-126,  139 
Wideband  vs.  narrowband,  296-298 


Zero  IF  (homodyne),  68,  90 


